Applied Mathematical Sciences 


7 D 
e ic = ie 


oa 


Partial 
Differential 
Equations | 


Third Edition 


Q) Springer 


Applied Mathematical Sciences 


Founding Editors 


F. John 
J. P. LaSalle 
L. Sirovich 


Volume 115 


Series Editors 


Anthony Bloch, Department of Mathematics, University of Michigan, Ann 
Arbor, MI, USA 


C. L. Epstein, Department of Mathematics, University of Pennsylvania, 
Philadelphia, PA, USA 


Alain Goriely, Department of Mathematics, University of Oxford, Oxford, UK 
Leslie Greengard, New York University, New York, NY, USA 


Advisory Editors 


J. Bell, Center for Computational Sciences and Engineering, Lawrence Berkeley 
National Laboratory, Berkeley, CA, USA 


P. Constantin, Department of Mathematics, Princeton University, Princeton, NJ, 
USA 


R. Durrett, Department of Mathematics, Duke University, Durham, CA, USA 


R. Kohn, Courant Institute of Mathematical Sciences, New York University, 
New York, NY, USA 


R. Pego, Department of Mathematical Sciences, Carnegie Mellon University, 
Pittsburgh, PA, USA 


L. Ryzhik, Department of Mathematics, Stanford University, Stanford, CA, USA 
A. Singer, Department of Mathematics, Princeton University, Princeton, NJ, USA 


A. Stevens, Department of Applied Mathematics, University of Minster, 
Miinster, Germany 


S. Wright, Computer Sciences Department, University of Wisconsin, Madison, 
WI, USA 


The mathematization of all sciences, the fading of traditional scientific 
boundaries, the impact of computer technology, the growing importance of 
computer modeling and the necessity of scientific planning all create the need 
both in education and research for books that are introductory to and abreast 
of these developments. The purpose of this series is to provide such books, 
suitable for the user of mathematics, the mathematician interested in applications, 
and the student scientist. In particular, this series will provide an outlet for topics 
of immediate interest because of the novelty of its treatment of an application or 
of mathematics being applied or lying close to applications. These books should 
be accessible to readers versed in mathematics or science and engineering, and 
will feature a lively tutorial style, a focus on topics of current interest, and present 
clear exposition of broad appeal. A compliment to the Applied Mathematical 
Sciences series is the Texts in Applied Mathematics series, which publishes 
textbooks suitable for advanced undergraduate and beginning graduate courses. 


Michael E. Taylor 


Partial Differential 
Equations | 


Basic Theory 


Third Edition 


g) Springer 


Michael E. Taylor 
Department of Mathematics 
University of North Carolina 
Chapel Hill, NC, USA 


ISSN 0066-5452 ISSN 2196-968X (electronic) 
Applied Mathematical Sciences 
ISBN 978-3-031-33858-8 ISBN 978-3-031-33859-5 (eBook) 


https://doi.org/10.1007/978-3-031-33859-5 
Mathematics Subject Classification: 35-01 


1* & 2™ editions: © Springer Science+Business Media, LLC 1996, 2011 
3" edition: © The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer 
Nature Switzerland AG 2023 


This work is subject to copyright. All rights are solely and exclusively licensed by the Publisher, 
whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, 
reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical 
way, and transmission or information storage and retrieval, electronic adaptation, computer software, 
or by similar or dissimilar methodology now known or hereafter developed. 

The use of general descriptive names, registered names, trademarks, service marks, etc. in this 
publication does not imply, even in the absence of a specific statement, that such names are exempt 
from the relevant protective laws and regulations and therefore free for general use. 

The publisher, the authors, and the editors are safe to assume that the advice and information in this 
book are believed to be true and accurate at the date of publication. Neither the publisher nor the 
authors or the editors give a warranty, expressed or implied, with respect to the material contained 
herein or for any errors or omissions that may have been made. The publisher remains neutral with 
regard to jurisdictional claims in published maps and institutional affiliations. 


This Springer imprint is published by the registered company Springer Nature Switzerland AG 
The registered company address is: Gewerbestrasse 11, 6330 Cham, Switzerland 


To my wife and daughter, Jane Hawkins 
and Diane Hart 


Contents of Volumes II and III 


Volume II: Qualitative Studies of Linear Equations 


7 Pseudodifferential Operators 

8 Spectral Theory 

9 Scattering by Obstacles 
10 Dirac Operators and Index Theory 
11 Brownian Motion and Potential Theory 
12 The 0-Neumann Problem 


C Connections and Curvature 
Volume III: Nonlinear Equations 


13 Function Space and Operator Theory for Nonlinear Analysis 
14 Nonlinear Elliptic Equations 

15 Nonlinear Parabolic Equations 

16 Nonlinear Hyperbolic Equations 

17 Euler and Navier-Stokes Equations for Incompressible Fluids 


18 Einstein’s Equations 


Vii 


Preface 


Partial differential equations are a many-faceted subject. Created to describe the 
mechanical behavior of objects such as vibrating strings and blowing winds, it 
has developed into a body of material that interacts with many branches of 
mathematics, such as differential geometry, complex analysis, and harmonic 
analysis, as well as a ubiquitous factor in the description and elucidation of 
problems in mathematical physics. 

This work is intended to provide a course of study of some of the major 
aspects of PDE. It is addressed to readers with a background in the basic intro- 
ductory graduate mathematics courses in American universities: elementary real 
and complex analysis, differential geometry, and measure theory. 

Chapter | provides background material on the theory of ordinary differential 
equations (ODE). This includes both very basic material—on topics such as the 
existence and uniqueness of solutions to ODE and explicit solutions to equations 
with constant coefficients and relations to linear algebra—and more sophisticated 
results—on flows generated by vector fields, connections with differential geom- 
etry, the calculus of differential forms, stationary action principles in mechanics, 
and their relation to Hamiltonian systems. We discuss equations of relativistic 
motion as well as equations of classical Newtonian mechanics. There are also 
applications to topological results, such as degree theory, the Brouwer fixed-point 
theorem, and the Jordan-Brouwer separation theorem. In this chapter, we also 
treat scalar first-order PDE, via the Hamilton—Jacobi theory. 

Chapters 2—6 constitute a survey of basic linear PDE. Chapter 2 begins with 
the derivation of some equations of continuum mechanics in a fashion similar to 
the derivation of ODE in mechanics in Chap. 1, via variational principles. We 
obtain equations for vibrating strings and membranes; these equations are not 
necessarily linear, and hence they will also provide sources of problems later, 
when nonlinear PDE is taken up. Further material in Chap. 2 centers around the 
Laplace operator, which on Euclidean space R” is 
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and the linear wave equation, 
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We also consider the Laplace operator on a general Riemannian manifold and the 
wave equation on a general Lorentz manifold. We discuss the basic consequences 
of Green’s formula, including energy conservation and finite propagation speed 
for solutions to linear wave equations. We also discuss Maxwell’s equations for 
electromagnetic fields and their relation with special relativity. Before we can 
establish general results on the solvability of these equations, it is necessary to 
develop some analytical techniques. This is done in the next couple of chapters. 

Chapter 3 is devoted to Fourier analysis and the theory of distributions. These 
topics are crucial for the study of linear PDE. We give a number of basic 
applications to the study of linear PDE with constant coefficients. Among these 
applications are results on harmonic and holomorphic functions in the plane, 
including a short treatment of elementary complex function theory. We derive 
explicit formulas for solutions to Laplace and wave equations on Euclidean 
space, and also the heat equation, 


Ou 
3 = — 
(3) OE Au = 0. 


We also produce solutions on certain subsets, such as rectangular regions, using 
the method of images. We include material on the discrete Fourier transform, 
germane to the discrete approximation of PDE, and on the fast evaluation of this 
transform, the FFT. Chapter 3 is the first chapter to make extensive use of 
functional analysis. Basic results on this topic are compiled in Appendix A, 
Outline of Functional Analysis. 

Sobolev spaces have proven to be a very effective tool in the existence theory 
of PDE, and in the study of regularity of solutions. In Chap. 4 we introduce 
Sobolev spaces and study some of their basic properties. We restrict attention to 
L’-Sobolev spaces, such as H*(R"), which consists of L? functions whose 
derivatives of order < k (defined in a distributional sense, in Chap. 3) belong to 
L?(R"), when k is a positive integer. We also replace k by a general real number 
s. The L?-Sobolev spaces, which are very useful for nonlinear PDE, are treated 
later, in Chap. 13. 

Chapter 5 is devoted to the study of the existence and regularity of solutions to 
linear elliptic PDE, on bounded regions. We begin with the Dirichlet problem for 
the Laplace operator, 


(4) Au = fonQ, uw=gon0Q, 


and then treat the Neumann problem and various other boundary problems, 
including some that apply to electromagnetic fields. We also study general 
boundary problems for linear elliptic operators, giving a condition that guarantees 
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regularity and solvability (perhaps given a finite number of linear conditions on 
the data). Also in Chap. 5 are some applications to other areas, such as a proof 
of the Riemann mapping theorem, first for smooth simply connected domains in 
the complex plane C, then, after a treatment of the Dirichlet problem for the 
Laplace operator on domains with rough boundary, for general simply connected 
domains in C. We also develop the Hodge theory and apply it to de Rham 
cohomology, extending the study of topological applications of differential forms 
begun in Chap. 1. 

In Chap. 6 we study linear evolution equations, in which there is a “time” 
variable ft, and initial data are given at t = 0. We discuss the heat and wave 
equations. We also treat Maxwell’s equations, for an electromagnetic field, and 
more general hyperbolic systems. We prove the Cauchy—Kowalewsky theorem, 
in the linear case, establishing local solvability of the Cauchy initial value 
problem for general linear PDE with analytic coefficients, and analytic data, as 
long as the initial surface is “noncharacteristic.” The nonlinear case is treated in 
Chap. 16. Also in Chap. 6 we treat geometrical optics, providing approximations 
to solutions of wave equations whose initial data either are highly oscillatory or 
possess simple singularities, such as a jump across a smooth hypersurface. 

Chapters 1-6, together with Appendix A and Appendix B, Manifolds, Vector 
Bundles, and Lie Groups, make up the first volume of this work. The second 
volume consists of Chaps. 7—12, covering a selection of more advanced topics in 
linear PDE, together with Appendix C, Connections and Curvature. 

Chapter 7 deals with pseudodifferential operators (~DOs). This class of 
operators includes both differential operators and parametrices of elliptic opera- 
tors, that is, inverses modulo smoothing operators. There is a “symbol calculus” 
allowing one to analyze products of DOs, useful for such a parametrix con- 
struction. The L?-boundedness of operators of order zero and the Garding 
inequality for elliptic DOs with positive symbol provide very useful tools in 
linear PDE, which will be used in many subsequent chapters. 

Chapter 8 is devoted to spectral theory, particularly for self-adjoint elliptic 
operators. First we give a proof of the spectral theorem for general self-adjoint 
operators on Hilbert space. Then we discuss conditions under which a differential 
operator yields a self-adjoint operator. We then discuss the asymptotic distribu- 
tion of eigenvalues of the Laplace operator on a bounded domain, making use of 
a construction of a parametrix for the heat equation from Chap. 7. Further 
material in Chap. 8 includes results on the spectral behavior of various specific 
differential operators, such as the Laplace operator on a sphere, and on hyperbolic 
space, the “harmonic oscillator” 


(5) —A+|z\’, 


and the operator 
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which arises in the simplest quantum mechanical model of the hydrogen atom. 
We also consider the Laplace operator on cones. 

In Chap. 9 we study the scattering of waves by a compact obstacle K in R°. 
This scattering theory is to some degree an extension of the spectral theory of the 
Laplace operator on R°\K, with the Dirichlet boundary condition. In addition to 
studying how a given obstacle scatters waves, we consider the inverse problem: 
how to determine an obstacle given data on how it scatters waves. 

Chapter 10 is devoted to the Atiyah—Singer index theorem. This gives a for- 
mula for the index of an elliptic operator D on a compact manifold M, defined by 


(7) Index D = dim ker D — dim ker D*. 


We establish this formula, which is an integral over M of a certain differential 
form defined by a pair of “curvatures,” when D is a first-order differential 
operator of “Dirac type,” a class that contains many important operators arising 
from differential geometry and complex analysis. Special cases of such a formula 
include the Chern—Gauss—Bonnet formula and the Riemann—Roch formula. We 
also discuss the significance of the latter formula in the study of Riemann 
surfaces. 

In Chap. 11 we study Brownian motion, described mathematically by Wiener 
measure on the space of continuous paths in R”. This provides a probabilistic 
approach to diffusion and it both uses and provides new tools for the analysis 
of the heat equation and variants, such as 


Ou 
(8) = —Au+ Vu, 


where V is a real-valued function. There is an integral formula for solutions to (8), 
known as the Feynman—Kac formula; it is an integral over path space with respect 
to the Wiener measure, of a fairly explicit integrand. We also derive an analogous 
integral formula for solutions to 


(9) —= —Au+ Xu, 


where X is a vector field. In this case, another tool is involved in constructing the 
integrand, the stochastic integral. We also study stochastic differential equations 
and applications to more general diffusion equations. 

In Chap. 12 we tackle the 0-Neumann problem, a boundary problem for an 
elliptic operator (essentially the Laplace operator) on a domain Q Cc C", which is 
very important in the theory of functions of several complex variables. From a 
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technical point of view, it is of particular interest that this boundary problem does 
not satisfy the regularity criteria investigated in Chap. 5. If © is “strongly 
pseudo-convex,” one has instead certain “subelliptic estimates,” which are 
established in Chap. 12. 

The third and final volume of this work contains Chaps. 13-18. It is here that 
we study nonlinear PDE. 

We prepare the way in Chap. 13 with a further development of function space 
and operator theory, for use in nonlinear analysis. This includes the theory of L?- 
Sobolev spaces and Holder spaces. We derive estimates in these spaces on 
nonlinear functions F(u), known as “Moser estimates,” which are very useful. We 
extend the theory of pseudodifferential operators to cases where the symbols have 
limited smoothness, and also develop a variant of DO theory, the theory of 
“paradifferential operators,” which has had a significant impact on nonlinear PDE 
since about 1980. We also estimate these operators, acting on the function spaces 
mentioned above. Other topics treated in Chap. 13 include Hardy spaces, com- 
pensated compactness, and “fuzzy functions.” 

Chapter 14 is devoted to nonlinear elliptic PDE, with an emphasis on 
second-order equations. There are three successive degrees of nonlinearity: 
semilinear equations, such as 


(10) Au = F(a,u, Vu), 

quasi-linear equations, such as 

(11) So a! (a, u, Vu)d;Oqu = F(a,u, Vu), 

and completely nonlinear equations, of the form 

(12) G(x, D’u) = 0. 

Differential geometry provides a rich source of such PDE, and Chap. 14 contains 
a number of geometrical applications. For example, to deform conformally a 
metric on a surface so its Gauss curvature changes from k(x) to K(x), one needs to 
solve the semilinear equation 


(13) Au = k(x) — K(a)e™. 


As another example, the graph of a function y = u(x) is a minimal submanifold of 
Euclidean space provided u solves the quasi-linear equation 


(14) (1+ |Vul?)Au + (Vu) - H(u)(Vu) = 0, 
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called the minimal surface equation. Here, H(u) = (0j0,w) is the Hessian matrix 
of u. On the other hand, this graph has Gauss curvature K(x) provided u solves the 
completely nonlinear equation 


(5) det H(u) = K(x)(14+|Vul?)°t?”, 


a Monge—Ampére equation. Equations (13)—(15) are all scalar, and the maximum 
principle plays a useful role in the analysis, together with a number of other tools. 
Chapter 14 also treats nonlinear systems. Important physical examples arise in 
studies of elastic bodies, as well as in other areas, such as the theory of liquid 
crystals. Geometric examples of systems considered in Chap. 14 include equa- 
tions for harmonic maps and equations for isometric embeddings of a 
Riemannian manifold in Euclidean space. 

In Chap. 15, we treat nonlinear parabolic equations. Partly echoing Chap. 14, 
we progress from a treatment of semilinear equations, 


(16) oe Lut+ F(a,u, Vu), 


where L is a linear operator, such as L = A, to a treatment of quasi-linear 
equations, such as 


(We do very little with completely nonlinear equations in this chapter.) We study 
systems as well as scalar equations. The first application of (16) we consider is to 
the parabolic equation method of constructing harmonic maps. We also consider 
“reaction—diffusion” equations, £ x & systems of the form (16), in which 
F(a,u, Vu) = X(u), where X is a vector field on R‘, and L is a diagonal 
operator, with diagonal elements a,;A, a; >0. These equations arise in mathe- 
matical models in biology and in chemistry. For example, u = (ui,---, ue) might 
represent the population densities of each of @ species of living creatures, dis- 
tributed over an area of land, interacting in a manner described by X and diffusing 
in a manner described by a,A. If there is a nonlinear (density-dependent) diffu- 
sion, one might have a system of the form (17). 

Another problem considered in Chap. 15 models the melting of ice; one has a 
linear heat equation in a region (filled with water) whose boundary (where the 
water touches the ice) is moving (as the ice melts). The nonlinearity in the 
problem involves the description of the boundary. We confine our analysis to a 
relatively simple one-dimensional case. 

Nonlinear hyperbolic equations are studied in Chap. 16. Here continuum 
mechanics is the major source of examples, and most of them are systems, rather 
than scalar equations. We establish local existence for solutions to first-order 


Preface XV 


hyperbolic systems, which are either “symmetric” or “symmetrizable.” An 
example of the latter class is the following system describing compressible fluid 
flow: 


Ov 


ot 


ap 


(18) OL 


1 
+V,vu+ —gradp = 0, +V.pt+p divv =0, 
p 


for a fluid with velocity v, density p, and pressure p, assumed to satisfy a relation 
p = p(p), called an “equation of state.” Solutions to such nonlinear systems tend 
to break down, due to shock formation. We devote a bit of attention to the study 
of weak solutions to nonlinear hyperbolic systems, with shocks. 

We also study second-order hyperbolic systems, such as systems for a 
k-dimensional membrane vibrating in R”, derived in Chap. 2. Another topic 
covered in Chap. 16 is the Cauchy—Kowalewsky theorem, in the nonlinear case. 
We use a method introduced by P. Garabedian to transform the Cauchy problem 
for an analytic equation into a symmetric hyperbolic system. 

In Chap. 17 we study incompressible fluid flow. This is governed by the Euler 
equation 


0 
(19) = + Viv = —gradp, dive =0, 


in the absence of viscosity, and by the Navier-Stokes equation 


0 
(20) aT +V,vu=vLv-— gradp, divv=0, 


in the presence of viscosity. Here £ is a second-order operator, the Laplace 
operator for a flow on flat space; the “viscosity” v is a positive quantity. Equation 
(19) shares some features with quasi-linear hyperbolic systems, though there are 
also significant differences. Similarly, (20) has a lot in common with semilinear 
parabolic systems. 

Chapter 18, the last chapter of this work, is devoted to Einstein’s gravitational 
equations: 


(21) Gye = 8nkT yp. 


Here Gj, is the Einstein tensor, given by Gj; = Ricj, — (1/2) gjx, where Ric;, 
is the Ricci tensor and S the scalar curvature, of a Lorentz manifold (or 
“spacetime”) with metric tensor g;;. On the right side of (21), T);, is the stress— 
energy tensor of the matter in the spacetime, and k is a positive constant, which 
can be identified with the gravitational constant of the Newtonian theory of 
gravity. In local coordinates, G';;, has a nonlinear expression in terms of g;; and 
its second-order derivatives. In the empty-space case, where Tj; = 0, (21) is a 
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quasi-linear second-order system for g;;. The freedom to change coordinates 
provides an obstruction to this equation being hyperbolic, but one can impose the 
use of “harmonic” coordinates as a constraint and transform (21) into a hyper- 
bolic system. In the presence of matter one couples (21) to other systems, 
obtaining more elaborate PDE. We treat this in two cases, in the presence of an 
electromagnetic field, and in the presence of a relativistic fluid. 

In addition to the 18 chapters just described, there are three appendices, 
already mentioned above. Appendix A gives definitions and basic properties of 
Banach and Hilbert spaces (of which L’-spaces and Sobolev spaces are exam- 
ples), Fréchet spaces (such as C®(IR”)), and other locally convex spaces (such as 
spaces of distributions). It discusses some basic facts about bounded linear 
operators, including some special properties of compact operators, and also 
considers certain classes of unbounded linear operators. This functional analytic 
material plays a major role in the development of PDE from Chap. 3 onward. 

Appendix B gives definitions and basic properties of manifolds and vector 
bundles. It also discusses some elementary properties of Lie groups, including a 
little representation theory, useful in Chap. 8, on spectral theory, as well as in the 
Chern—Weil construction. 

Appendix C, Connections and Curvature, contains material of a differential 
geometric nature, crucial for understanding many things done in Chaps. 10-18. 
We consider connections on general vector bundles, and their curvature. We 
discuss in detail the special properties of the primary case: the Levi—Civita 
connection and Riemann curvature tensor on a Riemannian manifold. We discuss 
the basic properties of the geometry of submanifolds, relating the second fun- 
damental form to curvature via the Gauss—Codazzi equations. We describe how 
vector bundles arise from principal bundles, which themselves carry various 
connections and curvature forms. We then discuss the Chern—Weil construction, 
yielding certain closed differential forms associated to curvatures of connections 
on principal bundles. We give several proofs of the classical Gauss—Bonnet 
theorem and some related results on two-dimensional surfaces, which are useful 
particularly in Chaps. 10 and 14. We also give a geometrical proof of the Chern— 
Gauss—Bonnet theorem, which can be contrasted with the proof in Chap. 10, as a 
consequence of the Atiyah—Singer index theorem. 

We mention that, in addition to these “global” appendices, there are appen- 
dices to some chapters. For example, Chap. 3 has an appendix on the gamma 
function. Chapter 6 has two appendices; Appendix A has some results on Banach 
spaces of harmonic functions useful for the proof of the linear Cauchy— 
Kowalewsky theorem, and Appendix B deals with the stationary phase formula, 
useful for the study of geometrical optics in Chap. 6 and also for results later, in 
Chap. 9. There are other chapters with such “local” appendices. Furthermore, 
there are two sections, both in Chap. 14, with appendices. Section 6, on minimal 
surfaces, has a companion, §6B, on the second variation of area and conse- 
quences, and §12, on nonlinear elliptic systems, has a companion, §12B, with 
complementary material. 
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Having described the scope of this work, we find it necessary to mention a 
number of topics in PDE that are not covered here or are touched on only very 
briefly. 

For example, we devote little attention to the real analytic theory of PDE. We 
note that harmonic functions on domains in R” are real analytic, but we do not 
discuss the analyticity of solutions to more general elliptic equations. We do 
prove the Cauchy—Kowalewsky theorem, on analytic PDE with analytic Cauchy 
data. We derive some simple results on unique continuation from these few 
analyticity results, but there is a large body of lore on unique continuation, for 
solutions to nonanalytic PDE, neglected here. 

There is little material on numerical methods. There are a few references to 
applications of the FFT and of “splitting methods.” Difference schemes for PDE 
are mentioned just once, in a set of exercises on scalar conservation laws. Finite 
element methods are neglected, as are many other numerical techniques. 

There is a large body of work on free boundary problems, but the only one 
considered here is a simple one-space dimensional problem, in Chap. 15. 

While we have considered a variety of equations arising from classical physics 
and from relativity, we have devoted relatively little attention to quantum 
mechanics. We have considered a few quantum systems in Chap. 8, including 
models of the hydrogen atom and the deuteron. Also, there are some exercises on 
potential scattering mentioned in Chap. 9. However, the physical theories behind 
these equations are not discussed here. 

There are a number of nonlinear evolution equations, such as the Korteweg— 
deVries equation, that have been perceived to provide infinite dimensional ana- 
logues of completely integrable Hamiltonian systems, and to arise “universally” 
in asymptotic analyses of solutions to various nonlinear wave equations. They are 
not here. Nor is there a treatment of the Yang—Mills equations for gauge fields, 
with their wonderful applications to the geometry and topology of 
four-dimensional manifolds. 

Of course, this is not a complete list of omitted material. One can go on and on 
listing important topics in this vast subject. The author can at best hope that the 
reader will find it easier to understand many of these topics with this book, than 
without it. 
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Introduction to the Second Edition 


In addition to making numerous small corrections to this work, collected over the 
past dozen years, I have taken the opportunity to make some very significant 
changes, some of which broaden the scope of the work, some of which clarify 
previous presentations, and a few of which correct errors that have come to my 
attention. 

There are seven additional sections in this edition, two in Volume 1, two in 
Volume 2, and three in Volume 3. Chapter 4 has a new section, “Sobolev spaces 
on rough domains,” which serves to clarify the treatment of the Dirichlet problem 
on rough domains in Chap. 5. Chapter 6 has a new section, “Boundary layer 
phenomena for the heat equation,” which will prove useful in one of the new 
sections in Chap. 17. Chapter 7 has a new section, “Operators of harmonic 
oscillator type,” and Chap. 10 has a section that presents an index formula for 
elliptic systems of operators of harmonic oscillator type. Chapter 13 has a new 
appendix, “Variations on complex interpolation,” which has material that is 
useful in the study of Zygmund spaces. Finally, Chap. 17 has two new sections, 
“Vanishing viscosity limits” and “From velocity convergence to flow 
convergence.” 

In addition, several other sections have been substantially rewritten, and 
numerous others polished to reflect insights gained through the use of these books 
over time. 


Introduction to the Third Edition 


I have provided further polishings and supplements for this third edition. New 
material in Volume | includes a section on rigid body motion in Chapter 1, 
which will tie in to the derivation of the Euler equation of incompressible fluid 
flow in Chapter 17. Chapter 3 has a new appendix on the central limit theorem, 
related to a random walk, which will tie in to the treatment of Brownian motion in 
Chapter 11. In addition there is an expanded treatment of the Poisson integral in 
Chapter 5, a section on the Schrédinger equation in Chapter 6, and an expanded 
treatment of holomorphic functional calculus in Appendix A. 
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New material in Volume 2 includes sections on a quantum model of the 
deuteron, a quantum adiabatic theorem, and a quantum ergodic theorem, and 
appendices on the classical ergodic theorem and on shifted wave equations in 
Chapter 8, as well as expanded treatments of the spectral theorem and of analysis 
on hyperbolic space in that chapter. In Chapter 11 I have added a section on 
diffusion on Riemannian manifolds, with application to models of relativistic 
diffusion. 

New material in Volume 3 includes a section on overdetermined elliptic 
systems in Chapter 14 and a section on Euler flows on rotating surfaces, influ- 
enced by the Coriolis force, in Chapter 17. 


Chapel Hill, USA Michael E. Taylor 
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Basic Theory of ODE and Vector Fields 


Introduction 


This chapter examines basic topics in the field of ordinary differential equations 
(ODE), as it has developed from the era of Newton into modern times. This is 
closely tied to the development of a number of concepts in advanced calculus. 
We begin with a brief discussion of the derivative of a vector-valued function of 
several variables as a linear map. We then establish in §2 the fundamental local 
existence and uniqueness of solutions to ODE, of the form 


d 
(0.1) a= Elta)» ylto) = vo, 


where F(t, y) is continuous in both arguments and Lipschitz in y, and y takes 
values in R*. The proof uses a nice tool known as the contraction mapping 
principle; next we use this principle to establish the inverse and implicit func- 
tion theorems in §3. After a discussion of constant-coefficient linear equations, 
in which we recall the basic results of linear algebra, in §4, we treat variable- 
coefficient linear ODE in 85, emphasizing a result known as Duhamel’s principle, 
and then use this to examine smooth dependence on parameters for solutions to 
nonlinear ODE in 86. 

The first six sections have a fairly purely analytic character and present ODE 
from a perspective similar to that seen in introductory courses. It is expected that 
the reader has seen much of this material before. A more leisurely treatment of 
the material in these sections can be found in [T], Introduction to Differential 
Equations. 

Beginning in §7, the material begins to acquire a geometrical flavor as well. 
This section interprets solutions to (0.1) in terms of a flow generated by a vector 
field. The next two sections examine the Lie derivative of vector fields and some 
of its implications for ODE. While we initially work on domains in R”, here we 
begin a transition to global constructions, involving working on manifolds and 
hence making use of concepts that are invariant under changes of coordinates. By 
the end of §13, on differential forms, this transition is complete. Appendix B, at 
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the end of this volume, collects some of the basic facts about manifolds which 
are useful for such an approach to analysis. One can find further background on 
calculus on surfaces and manifolds, on vector fields and flows, and on differential 
forms in [T2], Introduction to Analysis in Several Variables: Advanced Calculus. 

Physics is a major source of differential equations, and in §10 we discuss 
some of the basic ODE arising from Newton’s force law, converting the result- 
ing second-order ODE to first-order systems known as Hamiltonian systems. 
The study of Hamiltonian vector fields is a major focus for the subsequent sections 
in this chapter. In §11 we deal with an apparently disjoint topic, the equations of 
geodesics on a Riemannian manifold. We introduce the covariant derivative as a 
tool for expressing the geodesic equations, and later show that these equations 
can also be cast in Hamiltonian form. In §12 we study a general class of varia- 
tional problems, giving rise to both the equations of mechanics and the equations 
of geodesics, all expressible in Hamiltonian form. 

In §13 we develop the theory of differential forms, one of E. Cartan’s great con- 
tributions to analysis. There is a differential operator, called the exterior derivative, 
acting on differential forms. In beginning courses in multivariable calculus, one 
learns of div, grad, and curl as the major first-order differential operators; from 
a more advanced perspective, it is reasonable to think of the Lie derivative, the 
covariant derivative, and the exterior derivative as filling this role. The relevance 
of differential forms to ODE has many roots, but its most direct relevance for 
Hamiltonian systems is through the symplectic form, discussed in §14. 

Results on Hamiltonian systems are applied in §15 to the study of first-order 
nonlinear PDE for a single unknown. The next section studies “completely inte- 
grable” systems, reversing the perspective, to apply solutions to certain nonlinear 
PDE to the study of Hamiltonian systems. These two sections comprise what is 
known as Hamilton—Jacobi theory. In §17 we make a further study of integrable 
systems arising from central force problems, particularly the one involving the 
gravitational attraction of two bodies, the solution to which was Newton’s tri- 
umph. In §18 we derive and analyze equations for the motion of a rigid body in 
IR”. This can be seen as a precursor to the derivation of the equations of motion 
of an ideal fluid in Chapter 17. Section 19 gives a brief relativistic treatment of 
the equations of motion arising from the electromagnetic force, which ushered in 
Einstein’s theory of relativity. 

In §20 we apply material from 813 on differential forms to some topological 
results, such as the Brouwer fixed-point theorem, the study of the degree of a 
map between compact oriented manifolds, and the Jordan—Brouwer separation 
theorem. We apply the degree theory in §21 to a study of the index of a vector 
field, which reflects the behavior of its critical points. Other applications, and 
extensions, of results on degree theory and index theory in §§20—21 can be found 
in Appendix C and in Chaps. 5 and 10. Also the Brouwer fixed-point theorem will 
be extended to the Leray—Schauder fixed-point theorem, and applied to problems 
in nonlinear PDE, in Chap. 14. 

The appendix at the end of this chapter discusses the existence and uniqueness 
of solutions to (0.1) when F’ satisfies a condition weaker than Lipschitz in y. 
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Results established here are applicable to the study of ideal fluid flow, as will be 
seen in Chap. 17. 


1. The derivative 


Let O be an open subset of R”, and let F : O — R™ be acontinuous function. We 
say that F' is differentiable at a point « € O, with derivative L, if LD : R” — R™ 
is a linear transformation such that, for small y € R”, 


(1.1) F(a+y) = F(x) + Ly+ R(z,y), 
with 
(1.2) IR@ Il, 9 as y — 0. 


lly 


We denote the derivative at x by DF(x) = L. With respect to the standard bases 
of R” and R™, DF (2) is simply the matrix of partial derivatives, 


OF; 
1. DF =|—2 
13) a) = (2), 
so that, if v = (v1,..., Un) (regarded as a column vector), then 
OF, OF 
1.4 DF = ~—Up,--- ———vuz ]}. 
( ) (x)v ( : Arp Uk» >s Ar, a) 


It will be shown that F’ is differentiable whenever all the partial derivatives exist 
and are continuous on QO. In such a case we say that F is a C!-function on O. 
In general, F’ is said to be C* if all its partial derivatives of order < k exist and 
are continuous. 

In (1.2) we can use the Euclidean norm on R” and R™. This norm is defined 
by 


(1.5) lvl] = (@? + 402)? 
for 2 = (a1,..-,2n) € R”. Any other norm would do equally well. Some basic 


results on the Euclidean norm are derived in §4. 

More generally, the definition of the derivative given by (1.1) and (1.2) extends 
to a function F' : O — Y, where O is an open subset of X, and X and Y 
are Banach spaces. Basic material on Banach spaces appears in Appendix A, 
Functional Analysis. In this case, we require L to be a bounded linear map from 
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X to Y. The notion of differentiable function in this context is useful in the study 
of nonlinear PDE. 

We now derive the chain rule for the derivative. Let F : O — R™ be 
differentiable at x € O, as above; let U be a neighborhood of z = F(x) in 
R™; and let G: U > R* be differentiable at z. Consider H = Go F’. We have 


H(z+y)=G(F(r+y)) 
(1.6) = G(F(x) + DF(2)y + R(a,y)) 
= G(z) + DG(z)(DF(a)y + R(a,y)) + Ri(z,y) 
= G(z) + DG(z)DF(x)y + Ra(z,y), 
with 
IRo(a, »)Ih —+ 0 as yo 0. 
lly 


Thus G o F is differentiable at x, and 
(1.7) D(Go F)(x) = DG(F(a))- DF (2). 


This result works equally well if R”, R’, and R* are replaced by general Banach 
spaces. 

Another useful remark is that, by the fundamental theorem of calculus, applied 
to y(t) = F(x + ty), 


1 
(1.8) Fe+y)=F)+ | DF (a + ty)y dt, 
0 


provided F is C'. For a typical application, see (6.8). 

A closely related application of the fundamental theorem of calculus is that if 
we assume that fF : O — R” is differentiable in each variable separately, and 
that each OF /Ox; is continuous on O, then 


F(aty) = F(x) + S- [F(a + 2;) — F(x + 2;-1)] 


j=l 
(1.9) = F(2)+ 9° A;(2, yyy, 
j=l 
‘OF 
0 OX; 


where zp) = 0, z; = (y1,---,yj,0,--.,0), and {e;} is the standard basis of 
R”. Now (1.9) implies that F' is differentiable on O, as we stated beneath (1.4). 
As is shown in many calculus texts, by using the mean value theorem instead of 


1. The derivative 5 


the fundamental theorem of calculus, one can obtain a slightly sharper result. We 
leave the reconstruction of this argument to the reader. 

We now describe two convenient notations to express higher-order derivatives 
of a C*-function f : Q — R, where Q C R” is open. In the first, let J be a 
k-tuple of integers between 1 and n; J = (j1,...,j%). We set 


3) 
(1.10) f(z) = 0;,++-0;, f(z), 9; = ae 

Uj 
Also, we set |J| = &, the total order of differentiation. As will be seen in 
the exercises, 0;0;f = 0j0;f, provided f € C?(Q). Hence, if f € C*(Q), 
then 0;,---0;,f = Oo, -+-Og,f whenever {¢),...,4,} is a permutation of 
{j1,.--,jx}. Thus, another convenient notation to use is the following. Let a 
be an n-tuple of nonnegative integers, a = (a1,...,@p). Then we set 
(1.11) FO (x) = OM... 8% f(z), lal = ar t+: tan. 
Note that if |.J| = |a] = k and f € C*(Q), then 
(1.12) f(a) = f(a), with a4 = #{E: je = 4}. 


Correspondingly, there are two expressions for monomials in x: 


An 
ni? 


(1.13) me S25 6t,, cS ee og 
and «7 = 2°, provided J and « are related as in (1.12). Both of these notations 
are called “multi-index” notations. 

We now derive Taylor’s formula with remainder for a smooth function F' : 
Q — R, making use of these multi-index notations. We will apply the one-variable 
formula, 


(1.14) = v(t) = vp(0) + y'(O)t+ se" (Ot Hey sacle Zoot! + rz(t), 


with 


1 


t 
(1.15) r(t) = 5 7 (t — 8)*p**(s) ds, 
‘JO 


given yp € C*t1(I), I = (—a,a). Let us assume that 0 € © and that the line 
segment from 0 to x is contained in 2. We set y(t) = F(tx) and apply (1.14) and 
(1.15) with t = 1. Applying the chain rule, we have 


(1.16) GH => OF tax = >) PO Ces. 
j=l 


[J|=1 
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Differentiating again, we have 


(1.17) ghi=. SY) FM Ga* = Yo PY Ga)a”, 
| J|=1,|K|=1 | J|=2 


where, if |J| = k and |K| = @, we take J+ K = (ji,...,9¢,k1,---, ke). 
Inductively, we have 


(1.18) ora So Fe Gale’. 
|J|=k 


Hence, from (1.14), with ¢ = 1, 


F(a) = F(0)+ S$) FY (O)27 +--+ + x > FY) Oa? + Re(2), 


| J|=1 " |Jl=k 
or, more briefly, 
1 
= + pW) J 
(1.19) F@=>_ a (0)27 + Ry(2), 
|J|<k 

where 

1 1 
(1.20) R(z)=— > is (1 — s)*F\) (sa) as) a’. 

"|aj=k+1 “0 


This gives Taylor’s formula with remainder for F € C**+!(Q), in the J-multi- 
index notation. 
We also want to write the formula in the a-multi-index notation. We have 


(1.21) S5 PO G)2" = YS v(e)F Ga)2*, 
|J|=k lal=k 

where 

(1.22) via) = #{J :a=al(J)}, 


and we define the relation a = a(J) to hold provided (1.12) holds or, equiva- 
lently, provided x” = x®. Thus, (a) is uniquely defined by 


(1.23) S> v(a)a® = SO a7 = (a +--+ + an)¥. 


la|=k |J|=k 
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One sees that, if |a| = k, then v(a) is equal to the product of the number of 
combinations of k objects, taken a, at a time, times the number of combinations 
of k — a, objects, taken a at a time, and so on, times the number of combinations 
of k — (a1 +-+++Qn_1) objects, taken a,, at a time. Thus 


(1.24) 


k k-ay, k-—ay-+++— Qn_1 k! 
1 a2 An A1!A12Q! +++ An! 


In other words, for |a| = k, 


k} 


(1.25) v(a) where a! = ay!---ay!. 


=> al 
Thus, the Taylor formula (1.19) can be rewritten as 


1 


(1.26) F(z)= PO (0)a* + Re(2); 
lal<k 
where 
al 
(1.27) R(x) = S- = (/ (1 — s)F FP (sx) ds)”. 
jal=k+1 
Exercises 


1. Let Mnxn be the space of complex n x n matrices, and let det : Mnxn — C denote 
the determinant. Show that if J is the identity matrix, then 


D det(1)B =Tr B, 
1.e., 
< Pee ate) ane 


2. If A(t) = (ajx(t)) is acurve in Mn xn, use the expansion of (d/dt) det A(t) as a sum 
of n determinants, in which the rows of A(t) are successively differentiated, to show 
that, for A € Mnxn, 


D det(A)B = Tr (Cof(A)’- B), 


where Cof(A) is the cofactor matrix of A. 
3. Suppose A € M;,xn is invertible. Using 


det(A +tB) = (det A) det(I + tA7'B), 


show that 
D det(A)B = (det A) Tr (A7'B). 


Comparing the result of Exercise 2, deduce Cramer’s formula: 
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(1.28) (det A) A~* = Cof(A)’. 


4. Identify R? and C via z = x + iy. Then multiplication by i on C corresponds to 


applying 
ga) =). 
1 O 


Let O C R? be open, and let f : O > R? be C'. Say f = (u,v). Regard Df (x,y) 
as a 2 x 2 real matrix. One says f is holomorphic, or complex analytic, provided the 
Cauchy—Riemann equations hold: 


(1.29) = 


Show that this is equivalent to the condition 


Df(x,y)J = JDf(z, y). 


Generalize to O openinC™, f:O—C". 
5. If R(x) is a C°°-function near the origin in R”, satisfying R(O) = 0 and DR(O) = 
show that there exist smooth functions r;;(x) such that 


R(x) = > Tjk(L)@i Lp. 


(Hint: Using (1.8), write R(x) = ®(a)a, O(a = fo DR(tx)dt, since R(O) = 
Then ©(0) = DR(0) = 0, so (1.8) can be Bae again, to give ®(x) = U(x)zx.) 
6. If f is Ct ona region in R? containing [a, b] x {y}, show that 


d b 7 > Of 
a. fey) de = f 5p (a4) de. 


(Hint: Show that the left side is equal to 
b h 
; 1 Of 
lim = / — (x,y +s) ds dz.) 
a 0 oy 


7. Suppose F : O — R™ is a C?-function. Applying the fundamental theorem of 
calculus, first to 
G;(x) = F(a + he;) — F(a) 


(as a function of h) and then to 


Hyx(x) = Gj(a + hex) — Gj (x), 


where {e,} is the standard basis of IR”, show that if x € O and h is small, then 


F(a + he; + her) — F(x + hex) ares ) 


=[ [2 OrK in (x + 8ej + tex) ds dt. 


Similarly, show that this quantity is equal to 
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[ [? On; ae (z + sej + ter) dt ds. 


00. OP, 
Ox, Ox; Ox; Ox; 


Deduce that 


(Hint: Use Exercise 6.) 
Arguments that use the mean value theorem instead of the fundamental theorem of 
calculus can be found in many calculus texts. 


2. Fundamental local existence theorem for ODE 
The goal of this section is to establish the existence of solutions to an ODE: 


d 
(2.1) Lh =F(t,y), y(to) = yo- 


We will prove the following fundamental result. 


Theorem 2.1. Let yo € O, an open subset of R", I C R an interval contain- 
ing to. Suppose F is continuous on I x O and satisfies the following Lipschitz 
estimate in y: 


fort € I, y; € O. Then the (2.1) has a unique solution on some t-interval 
containing to. 


To begin the proof, we note that the (2.1) is equivalent to the integral equation 


(2:3) uit) =w-+ f Fls.u(s)) ds 


Existence will be established via the Picard iteration method, which is the 
following. Guess yo(t), e.g., yo(t) = yo. Then set 


t 
(2.4) mt) =o f F(s,ua(s)) a 
to 


We aim to show that, as k — 00, yx(t) converges to a (unique) solution of (2.3), 
at least for t close enough to to. To do this, we will use the following tool, known 
as the contraction mapping principle. 


Theorem 2.2. Let X be a complete metric space, and letT : X — X satisfy 


(2.5) dist(Tx,Ty) <r dist(x, y), 
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for some r < 1. (We say that T is a contraction.) Then T has a unique fixed 
point x. For any yo € X, T* yo 3 x ask + ow. 


Proof. Pick yo € X, and let y, = T*yo. Then 


dist (yr41, Ye) < r* dist (y1, Yo), 


so 
dist(Yk+ms Yk) S dist(Ye+m,Yk+tm—1) +++ + dist(yr+1, Ye) 
(2.6) < (rh +--+ rh") dist(y1, yo) 
<r*(1- ry? dist(y1, yo). 
It follows that (y,) is a Cauchy sequence, so it converges; y, —> wx. Since 
Ty = Yr+1 and T is continuous, it follows that Tx = z, that is, x is a fixed point. 
The uniqueness of the fixed point is clear from the estimate dist(T'x, Tx’) <r 


dist(x, x’), which implies dist(x, x’) = 0 if x and 2’ are fixed points. This com- 
pletes the proof. 


Tackling the solvability of (2.3), we look for a fixed point of T’, defined by 


(2.7) (Ty)(t) = yo + fi F(s,y(s)) ds. 


Let 


(2.8) X= fu € C(J,R”) : u(to) = yo, “up |u(t) — yol| < K}. 


Here J = [to — €,to + €], where ¢ will be chosen, sufficiently small, below. K is 
picked so {y : ||y — yo|| < A} is contained in O, and we also suppose J C I. 
Then there exists an JZ such that 


(2.9) sup |F(s,y)|| < M. 
s€J,||y—yoll<K 


Then, provided 


kK 
2.10 <2 
(2.10) aes 
we have 
(2.11) T:X 3X. 


Now, using the Lipschitz hypothesis (2.2), we have, for t € J, 
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(2.12) ||(Ty)(t)— (Lz) (4) | < | Lly(s)—2(s)|| ds < e L sup [ly(s) — 2(s)lI, 


to 


assuming y and z belong to X. It follows that T is a contraction on X provided 
one has 


1 
(2.13) E< Z’ 
in addition to the hypotheses above. This proves Theorem 2.1. 

In view of the lower bound on the length of the interval J on which the exis- 
tence theorem works, it is easy to show that the only way a solution can fail to be 
globally defined, that is, to exist for all ¢ € I, is for y(t) to “explode to infinity” 
by leaving every compact set K C O, as t > t1, for some t; € I. 

We remark that the local existence proof given above works if R” is replaced 
by any Banach space. 

Often one wants to deal with a higher-order ODE. There is a standard method 
of reducing an nth-order ODE 


(2.14) gO = FEO ay) 
to a first-order system. One sets u = (uo,..-,Un—1), With 
(2.15) u=y, u=y, 
and then 
du 
(2.16) FT (ui, +25 Un—1, f(t, Uo, --- ,Un-1)) = g(t,u). 


If y takes values in R*, then wu takes values in R*”. 

If the system (2.1) is nonautonomous, that is, if F’ explicitly depends on ¢, it 
can be converted to an autonomous system (one with no explicit t-dependence) as 
follows. Set z = (t, y). We then have 


(2.17) z= (1,y')=(1, F(2)) = G(z). 


Sometimes this process destroys important features of the original system (2.1). 
For example, if (2.1) is linear, (2.17) might be nonlinear. Nevertheless, the trick 
of converting (2.1) to (2.17) has some uses. 

Many systems of ODE are difficult to solve explicitly. One very basic class of 
ODE can be solved explicitly, in terms of integrals, namely the single first-order 
linear ODE: 


dy _ 


(2.18) a a(t)y + b(t), y(0) = yo, 


where a(t) and b(t) are continuous real- or complex-valued functions. Set 
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t 
(2.19) A(t) =| a(s) ds. 
0 
Then (2.18) can be written as 
A(t) d —A(t) —_ 
(2.20) e ae y) = 5), 
which yields 
t 

(2.21) y(t) = eAM yy + Ae f e A()b(s) ds. 

0 


Compare this result with formulas (4.58) and (5.13), in subsequent sections of this 
chapter. 


Exercises 


1. Solve the initial-value problem 
y=y, y(0) =a, 


given a € R. On what t-interval is the solution defined? 

2. Under the hypotheses of Theorem 2.1, if y solves (2.1) for t € [T, 71], and y(t) € K, 
compact in Q, for all such ¢, prove that y(t) extends to a solution for t € [So, Si], with 
So < To, Ti > To, as stated beneath (2.13). 

3. Let M be a compact, smooth surface in R”. Suppose F : R” — R” is a smooth 
map (vector field) such that, for each x € M, F(a) is tangent to M, that is, the line 
a(t) = « + tF (x) is tangent to M at x, at t = 0. Show that if « € M, then the 
initial-value problem 

y = Fy), y(0)=2 
has a solution for all t € R, and y(t) € M for all ¢. 
(Hint: Locally, straighten out / to be a linear subspace of R”, to which F is tangent. 
Use uniqueness. Material in §3 will help do this local straightening.) 
Reconsider this problem after reading §7. 
4. Show that the initial-value problem 


dx 
dt 


dy 


= —a(x" =F y”), dt = —y(2" zig y”), x(0) = 20, y(0) = Yo 


has a solution for all t > 0, but not for all ¢ < 0, unless (x0, yo) = (0,0). 


3. Inverse function and implicit function theorems 


We will use the contraction mapping principle to establish the inverse function 
theorem, which together with its corollary, the implicit function theorem, is a 
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fundamental result in multivariable calculus. First we state the inverse function 
theorem. 


Theorem 3.1. Let F be a C*-map from an open neighborhood Q. of po € R" to 
R”, with qo = F (po). Suppose the derivative DF (po) is invertible. Then there is 
a neighborhood U of po and a neighborhood V of qo such that F : U — V is 
one-to-one and onto, and F~! : V —+ U is aC*-map. (One says that F : U + V 
is a diffeomorphism.) 


First we show that F’ is one-to-one on a neighborhood of po, under these 
hypotheses. The following result does this, and also points to situations where 
we can say that F’ is a global diffeomorphism. 


Proposition 3.2. Let Q C R” be open and convex, F : Q — R” of class Ct. 
Assume there exists a <1 such that for all u € Q, ||DF(u) — I|| < a. Then F is 
one-to-one on Q. 


Proof. We can write 
F(u)=u+R(u), DR(u)=DF(u)—T, ||DR(u)|| <a. 


Hence, for u,v € Q, 


u—v = {F(u) — F(v)} — {R(u) — R(v)}, 


and the hypotheses imply || R(w) — R(v)|| < alju—v 


» SO 
(1—a)llu—ol| <||/F@) - FO), Vu eo. 


This shows fF’ is one-to-one. 


Proof of Theorem 3.1. Using the chain rule, we can reduce to the case pp = qo = 
0 and DF (po) = I, the identity matrix, so we suppose this has been done. Thus, 


(3.1) F(u) =u+R(u), R(0)=0, DR(0) =0. 
For v small, we want to solve 

(3.2) F(u) =v. 

This is equivalent to u + R(u) = v, so let 

(3.3) Ty(u) =v — R(w). 


Thus, solving (3.2) is equivalent to solving 
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(3.4) Ty(u) = u. 
We look for a fixed point u = K(v) = F~1+(v). Also, we want to prove that 


DkK(0) = I, that is, that K(v) = v+ r(v), with r(v) = o(|lv||). If we succeed in 
doing this, it follows easily that, for general x close to 0, 


DK(a) = (DF(K(2))) 


and a simple inductive argument shows that K is C* if F is C*. Now consider 


(3.5) Ty: Xy — Xv, 
with 
(3.6) Xy = {uEeN: |lu—v|| < Av}, 


where we set 


(3.7) Ay= sup ||R(w)|l. 
|| wl] <2Iu| 


We claim that (3.5) holds if ||v|| is sufficiently small. To prove this, note that 
Ty(u) — v = —R(w), so we need to show that, provided ||v|| is small, u € X, 
implies || R(u)|| < A,. But indeed, if u € X,, then ||u|| < ||v|| + A,, which is 
< 2||v|| if |u|] is small, so then 


|R(w)|| < sup ||R(w)|] = Ao; 


~ |jwl]<2llal 


this establishes (3.5). 
Note that if ||v|| is small enough, the map (3.5) is a contraction map, so there 
exists a unique fixed point u = K(v) € X,. Also note that since u € X,, 


(3.8) | (v) — v|| < Ay = of|lv])). 
Hence, the inverse function theorem is proved. 


We can obtain the following implicit function theorem as a consequence of the 
inverse function theorem. 
Theorem 3.3. Suppose U is a neighborhood of x9 € R*, V is a neighborhood 
of zo € R*, and 


(3.9) F:UxV—R‘ 
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is a C*-map. Assume DF (xo, 0) is invertible; say F(x, 2) = uo. Then the 
equation F(x, z) = ug defines z = f(x, uo) for x near xo, with f a C*-map. 


Proof. Consider H : U x V — R* x R* defined by 


(3.10) A(@,2) = (@,F (a2). 
We have 

i DP 
(3.11) DA = & aa 


Thus DH(zo, zo) is invertible, so J = H~! exists and is C*, by the inverse 
function theorem. It is clear that J(x, uo) has the form 


(3.12) J (a;00) = (x, Fea), 


and f is the desired map. 


As in §2, we remark that the inverse function theorem generalizes. One can 
replace R” by any Banach space and the proof of Theorem 3.1 given above 
extends with no change. Such generalizations are useful in nonlinear PDE, as 
we will see in Chap. 14. 


Exercises 


1. Suppose that F : U — R” is a C?-map, U is open in R”, p € U, and DF(p) is 
invertible. With g = Fp), define a map N on a neighborhood of p by 


(3.13) N(x) =2+ DF(x)"'(q- F(a)). 
Show that there exists e > 0 and C' < oo such that, forO < r <e, 
||z — pll <r => ||N(x) —pl| < Cr’. 


Conclude that if ||71 — p|| <r, with r < min(e, 1/2C), then xj41 = N(x;) defines a 
sequence converging very rapidly to p. This is the basis of Newton’s method, for solving 
F(p) = q for p. 

(Hint: Write x = p+ y, F(x) = F(p)+DF(x)y+ R, with R given as in (1.27), with 
k = 2. Then N(x) =p+9, 9 = —DF(a)'R.) 

2. Applying Newton’s method to f(x) = 1/2, show that you get a fast approximation to 
division using only addition and multiplication. 

(Hint: Carry out the calculation of N(x) in this case and notice a “miracle.”) 

3. Identify R?” with C” via z = «+iy, as in Exercise 4 of §1. Let U C R?” be open, and 
let F : U — R?” be Ct. Assume that p € U and DF(p) is invertible. If F~' : V + U 
is given as in Theorem 3.1, show that F~! is holomorphic provided F is. 

4. Let O C R” be open. We say that a function f € C(O) is real analytic provided 
that, for each zo € O, we have a convergent power-series expansion 
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(3.14) ia=>" = f(a) (a = 0)", 


valid in a neighborhood of x9. Show that we can let x be complex in (3.14), and obtain 
an extension f to a neighborhood of O in C”. Show that the extended function is 


holomorphic, that is, satisfies the Cauchy—Riemann equations. 


Remark. It can be shown that, conversely, any holomorphic function has a power-series 
expansion. See (2.30) of Chap. 3 for one such proof. For the next exercise, assume this 


to be known. 


5. Let O C R” be open, p € O, and f : O — R” be real analytic, with Df (p) invertible. 


Take f~' : V — U as in Theorem 3.1. Show f~? is real analytic. 


(Hint: Consider a holomorphic extension F' : 2 — C” of f, and apply Exercise 3.) 


4. Constant-coefficient linear systems; exponentiation 
of matrices 


Let A be an n x n matrix, real or complex. We consider the linear ODE 
(4.1) y= Ay, y(0) = yo. 

In analogy to the scalar case, we can produce the solution in the form 
(4.2) y(t) = eyo, 


where we define the matrix exponential 


k 
(4.3) ay Ak. 


We will establish estimates implying the convergence of this infinite series for 
all real t, indeed for all complex ¢. Then term-by-term differentiation is valid and 
gives (4.1). To discuss convergence of (4.3), we need the notion of the norm of 
a matrix. This is a special case of results discussed in Appendix A, Functional 


Analysis. 
If u = (w1,..., Un) belongs to R” or to C”, set, as in (1.5), 
1/2 
(4.4) lel] = (luca? +--+ + femal?) 2. 


Then, if A is an n x n matrix, set 
(4.5) || Al] = sup{||Aul| : [|u|] < 1}. 


The norm (4.4) possesses the following properties: 
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(4.6) \|ul] > 0, ||u|| = 0 ifand only if u = 0, 
(4.7) ||cu|| = |e] ||z||, for real or complex c, 
(4.8) Iu + oll < llell + [loll 


The last property, known as the triangle inequality, follows from Cauchy’s 
inequality: 


(4.9) (u,v) < [ell - [loll 


where the inner product is (u,v) = u101 + +--+ UnDn. To deduce (4.8) from 
(4.9), just square both sides of (4.8). To prove (4.9), use (u— v, u— v) > 0 to get 


2 Re (u,v) < |lull? + |ll]?. 
Then replace u by e’’u to deduce 
2\(u, v)| < |lull? + lloll?. 
Next, replace u by tu and v by t~!v, to get 
2\(u, v)| < ellull? +t? loll?, 


for any t > 0. Picking t so that t? = ||v||/||u||, we have Cauchy’s inequality (4.9). 
Given (4.6)-(4.8), we easily get 


|| Al] 2 9, 
(4.10) I|eAl] = Jel |All, 
|A+ Bll < ||Al] + 1B]. 


Also, || A|| = 0 if and only if A = 0. The fact that || A|| is the smallest constant 
such that || Au|| < A’||ul] gives 


(4.11) | AB < |All - |B. 
In particular, 
(4.12) ||A* |] < Al. 


This makes it easy to check the convergence of the power-series (4.3). Let us note 


that d 
—etA — ActA = cA J. 
dt 


Power-series manipulations can be used to establish the identity 
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(4.13) eAgiA — gloria, 
Another way to prove this is as follows. Regard t as fixed; denote the left side of 


(4.13) as X(s) and the right side as Y(s). Then differentiation with respect to s 
gives, respectively, 


(4.14) 


so the uniqueness of solutions to the ODE implies X(s) = Y(s) for all s. For 
another approach, we compute 


(4.15) Felt) det = elstA 4e-ta _ elstt)A 4e-ta = 0, 


hence e(st)4e—t4 


is independent of t, so 
(4.16) elstt)Ae-ta _ 684 Vs tER. 
Taking s = 0 gives e’4e-*4 = I, so e~*4 = (e'4)—1. Then multiplying (4.16) 
on the right by e’4 gives (4.13). 
We note that (4.13) is a special case of the following. 
Proposition 4.1. e’(4+8) = e'4e! for all t, if and only if A and B commute. 
Proof. Let 
(4.17) Y(t)=e4t), Z(t) = e*4e'?. 


Note that Y(0) = Z(0) = J, so it suffices to show that Y(t) and Z(t) satisfy the 
same ODE, to deduce that they coincide. Clearly, 


(4.18) y’(t) = (A+ B)Y(é). 
Meanwhile, 
(4.19) Z'(t) = Ae*e"? + ef Be®®. 


Thus we get the (4.18) for Z(t) provided we know that 
(4.20) e'AB = Be'A if AB= BA. 
This follows from the power-series expansion for e’4, together with the fact that 


(4.21) A®‘B=BA*, Vk>0, ifAB=BA. 
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For the converse, if Y(t) = Z(t) for all t, then e4B = Be’, by (4.19), and 
hence, taking the t-derivative, e’4 AB = BAe"; setting t = 0 gives AB = BA. 


For an alternative approach to the first part, we can compute 


d 
& Qt(AtB),—-tA,—tB 
dt 


and use (4.20) to show this is zero if AB = BA. We leave the details to the reader. 
If A is in diagonal form, 


ay 
(4.22) A= ; 
an 
then clearly 
ef 
(4.23) f4 = 
elon 


The following result makes it useful to diagonalize A in order to compute e’4. 
Proposition 4.2. If K is an invertible matrix and B = KAK —! then 
(4.24) fF — Kk 4 k-}, 


Proof. This follows from the power-series expansion (4.3), given the observation 
that 


(4.25) Be = K A¥ k7}” 

In view of (4.22)—(4.24), it is convenient to record a few standard results about 
eigenvalues and eigenvectors here. Let A be an n x n matrix over fF, F = R or 
C. An eigenvector of A is anonzero u € F” such that 


(4.26) Au = Xu, 


for some A € F’. Such an eigenvector exists if and only if A— AJ: F” > F” is 
not invertible, that is, if and only if 


(4.27) det(A — AI) = 0. 


Now (4.27) is a polynomial equation, so it always has a complex root. This proves 
the following. 
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Proposition 4.3. Given an n x n matrix A, there exists at least one (complex) 
eigenvector u. 


Of course, if A is real, and we know there is a real root of (4.27) (e.g., if n is 
odd), then a real eigenvector exists. One important class of matrices guaranteed to 
have real eigenvalues is the class of self-adjoint matrices. The adjoint of ann x n 
complex matrix is specified by the identity (Au, v) = (u, A*v). 


Proposition 4.4. [f A = A%*, then all eigenvalues of A are real. 
Proof. Au = Au implies 
(4.28) Allul|? = (Au, w) = (Au, u) = (u, Au) = (u, Au) = Allul]?. 
Hence \ = , if u #0. 
We now establish the following important result. 


Theorem 4.5. [f A = A%*, then there is an orthonormal basis of C" consisting of 
eigenvectors of A. 


Proof. Let u, be one unit eigenvector; Au; = Au. Existence is guaranteed by 
Proposition 4.3. Let V = (u;)+ be the orthogonal complement of the linear span 
of u;. Then dim V is n — 1 and 


(4.29) A:V—->V, ifA=A"*. 
The result follows by induction on n. 


Corollary 4.6. If A = A® is a real symmetric matrix, then there is an 
orthonormal basis of R” consisting of eigenvectors of A. 


Proof. By Proposition 4.4 and the remarks following Proposition 4.3, there is one 
unit eigenvector u; € R”. The rest of the proof is as above. 


The proofs of the last four results rest on the fact that every nonconstant 
polynomial has a complex root. This is the fundamental theorem of algebra. A 
proof is given in §20 (Exercise 5), and another after Corollary 4.7 of Chap. 3. 
An alternative approach to Proposition 4.3 when A = A”, yielding Proposition 
4.4-Corollary 4.6, is given in one of the exercises at the end of this section. 

Given an ODE in upper triangular form, 


aii * * 


dy 


(4.30) ra Bo * 
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you can solve the last ODE for y,,, as it is just dy, /dt = QnnYn. Then you get a 
single nonhomogeneous ODE for y,—1, which can be solved as demonstrated in 
(2.18)-(2.21), and you can continue inductively to solve. Thus, it is often useful 
to be able to put an n x n matrix A in upper triangular form, with respect to a 
convenient choice of basis. We will establish two results along these lines. The 
first is due to Schur. 


Theorem 4.7. For any n x n matrix A, there is an orthonormal basis uy, ..., Un 
of C” with respect to which A is in upper triangular form. 


This result is equivalent to the following proposition. 


Proposition 4.8. For any A, there is a sequence of vector spaces V; of dimension 
gj, contained in C”, with 


(4.31) Vn D Vn-1 D+: DV 
and 
(4.32) A:V; — Vj. 


To see the equivalence, if we are granted (4.31)-(4.32), pick u,LV,—1, a unit 
vector, then pick u,—1 € V,—1 such that u,—11V,~—2, and so forth. Meanwhile, 
Proposition 4.8 is a simple inductive consequence of the following result. 


Lemma 4.9. For any matrix A acting on V,,, there is a linear subspace Vy-1, of 
codimension 1, such that A : Vn—1 > Vn—1. 


Proof. Use Proposition 4.3, applied to A*. There is a vector v1 such that A*v; = 
Av,. Let Vr»-1 = (v1)+. This completes the proof of the lemma, hence of 
Theorem 4.7. 


Let us look more closely at what you can say about solutions to an ODE that 
has been put in the form (4.30). As mentioned, we can obtain y; inductively by 
solving nonhomogeneous scalar ODE 


dy; 
(4.33) a = A55U5 + Bt), 
where 6; (t) is a linear combination of y;+1(t),...,Yn(t), and the formula (2.21) 


applies, with A(t) = aj;t. We have y,(t) = Ce%”", so b,_1(t) is a multiple 
of e*"*. If dn-in—1 # Ann; Yn—i(t) will be a linear combination of e%"* 
and e*"-1."-1", but if @n—-nn—1 = Ann; Yn—1(t) may be a linear combination 
of e@”" and te“"". Further integration will involve [ p(t)e*'dt, where p(t) is a 
polynomial. That no other sort of function will arise is guaranteed by the following 
result. 
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Lemma 4.10. Jf p(t) € Pn, the space of polynomials of degree < n, and a # 0, 
then 


(4.34) / p(t)e™ dt = q(t)e™ +C, 


for some q(t) € Pn. 


Proof. The map p = Tq defined by (d/dt)(q(t)e®’) = p(t)e“ is a map on P; 
in fact, we have 


(4.35) Tq(t) = aq(t) + '(t). 


It suffices to show that T : P,, + Py, is invertible. But D = d/dt is nilpotent on 
Pn; D”+! = 0. Hence 


Tt =a (I+a'D)"* =a" (L-a"D+---+a7"(-D)"). 


Note that this gives a neat formula for the integral (4.34). For example, 


pages d= —(" + nt? 1+... +nbe*+C 
(4.36) 
i 2 1 n\ —-t 
=-nl(1+t+ st desea Je a Of 
2 n! 


This could also be established by integration by parts and induction. Of course, 
when a = 0 in (4.34), the result is different; g(t) is a polynomial of degree n + 1. 

Now the implication for the solution to (4.30) is that all the components of y(t) 
are products of polynomials and exponentials. By Theorem 4.7, we can draw the 
same conclusion about the solution to dy/dt = Ay for any n x n matrix A. We 
can formally state the result as follows. 


Proposition 4.11. For any n x n matrix A, 
(4.37) aor \ Era 


where {2;} is the set of eigenvalues of A and v;(t) are C”-valued polynomials. 
All the v;(t) are constant when A is diagonalizable. 


To see that the \, are the eigenvalues of A, note that in the upper triangular case 
only the exponentials e774 arise, and in that case the eigenvalues are precisely the 
diagonal elements. 

If we let €, denote the space of C”-valued functions of the form V(t) = 
e™y(t), where v(t) is a C"-valued polynomial, then €) is invariant under the 
action of both d/dt and A, hence of d/dt — A. Hence, if a sum V;(t) +--+ + 
V(t), Vj(t) € Ey, (with js distinct), is annihilated by d/dt — A, so is each term 
in this sum. 
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Therefore, if (4.37) is a sum over the distinct eigenvalues A; of A, it follows 


that each term e*/*v,(t) is annihilated by d/dt — A or, equivalently, is of the form 


e'Aw,, where w; = v;(0). This leads to the following conclusion. Set 


(4.38) Gy, ={veEC": e4u = e v(t), v(t) polynomial}. 
Then C” has a direct-sum decomposition 
(4.39) C"=G)y, +-:-+Gy,, 


where A;,...,A,% are the distinct eigenvalues of A. Furthermore, each G ; Is 
invariant under A, and 


(4.40) A; = Alg,. has exactly one eigenvalue, ,. 
a 

This last statement holds because e’4v involves only the exponential e*’', when 

v € G),. We say that G), is the generalized eigenspace of A, with eigenvalue 


Aj. Of course, Gy, contains ker (A — A; J). Now B; = A; — Aj has only 0 as an 
eigenvalue. It is subject to the following result. 


Lemma 4.12. If N : Ck — C¥ has only 0) as an eigenvalue, then N is nilpotent; 
in fact, 


(4.41) N™ =0 for somem <k. 


Proof. Let W; = N/(C*); then C* 5 W, D We D --- is a sequence of finite- 
dimensional vector spaces, each invariant under N. This sequence must stabilize, 
so for some m, N : Wm — Wy bijectively. If W,, 4 0, N has a nonzero 
eigenvalue. 


We next discuss the famous Jordan normal form of a complex n x n matrix. 
The result is the following. 


Theorem 4.13. /f A is ann x n matrix, then there is a basis of C” with respect 
to which A becomes a direct sum of blocks of the form 


Aj 1 
(4.42) Aj 
1 
Aj 


These blocks are known as Jordan blocks. In light of the decomposition (4.39) 
and Lemma 4.12, it suffices to establish the Jordan normal form for a k x k nilpo- 
tent matrix N. (Then A; = 0.) We turn to this task. 
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Given vp € V = C*, let m be the smallest integer such that N’ vg = 0; m < 
k.Ifm = k, then {v9, Bug, ...,. Nuvo} gives a basis of V, putting N in Jordan 
normal form. In any case, we call {vg,..., N™~!vg} a Jordan string (or string, 
for short). To obtain a Jordan canonical form for N, it will suffice to find a basis 
of V consisting of a family of strings. We will show that this can be done by 
induction on dim V. This result is clear for dim V < 1. 

So, given a nilpotent N : V > V, we can assume inductively that V; = N(V) 
has a basis that is a union of strings: 


(4.43) {v;,Nv;,...,N%uj}, 1<j<d. 


Furthermore, each v; has the form v; = Nw, for some w; € V. Hence we have 
the following strings in V: 


(4.44) {w;,v; = Nw;, Nv;,...,N%v;}, 1l<j<d. 
We claim that the vectors in (4.44) are linearly independent. To see this, we apply 


N to a linear combination and invoke the independence of the vectors in (4.43). 
In more detail, suppose there is a linear dependence relation 


d d 4&; 
(4.45) So bjwji + S> pe aj;eN*v; = 0. 
j=l j=1 l=0 


Applying N yields 
d €;-1 
(4.46) 3 bv; +9 > azeN 0; =0. 
j=1 @=0 


This is a linear dependence relation among the vectors listed in (4.43), so 


(4.47) B=, og=0, Vj etl ind} G1 
Hence (4.45) yields 

d 
(4.48) S > a5,0,N%u,; =0, 


again a linear dependence relation among vectors listed in (4.43), so 
(4.49) ajz,—0, Wz € {1,...,d}, 


and we have the linear independence of all the vectors listed in (4.44). 
To proceed, note that the vectors in 


4. Constant-coefficient linear systems; exponentiation of matrices 25 
(4.50) {Ning :1 <9 <d} 


all belong to ker N and are linearly independent. If this set does not span ker N, 
complete it to a basis of ker N, by adding 


(4.51) iolgateyeete 


We now claim that the vectors listed in (4.44) and (4.51) are linearly independent. 
Indeed, suppose there is a linear dependence relation 


V d d 4 
(4.52) Yo cigs + D> v5; + J) Yo ajeN'v; = 0. 
i=1 j=l J=1 £=0 


Applying N yields an identity of the form (4.46), which in turn yields identities 
of the form (4.47). Hence (4.52) yields 


Vy d 
(4.53) Scifi t D> a5,2,N%v; =0, 
i=1 j=l 
thus yielding 
(4.54) G=0, Vie {lug}, ay =0, V7 etl... d}, 


since (4.50)-(4.51) form a basis of ker N. We have the asserted linear indepen- 
dence of 


(4.55) {w;,v;,...,N%u;}, l<j<d, {Eten ny Goh 


Finally, we claim this is a basis of V. 
To see this, note that the number of vectors in (4.44) is dim N(V) + d, while 
dim ker N = d+ v. Hence the number of vectors in (4.55) is 


(4.56) dim N(V) + d+v=dim N(V) + dimker N = dimV. 


Thus (4.55) yields a basis of V, and the strings (4.44) together with {€,},...{& }. 
form a string basis of V. This proves Theorem 4.13. 


The argument above is part of an argument of Filippov. In fact, Filippov’s proof 
contains a further clever twist, enabling one to prove Theorem 4.13 without using 
the decomposition (4.39). However, since we got this decomposition almost for 
free as a byproduct of the ODE analysis in Proposition 4.11, this author decided 
to make use of it. See Strang [Str] for Filippov’s proof. 

We have seen how constructing e' solves the (4.1). We can also use it to solve 
a nonhomogeneous equation, of the form 
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(4.57) y' = Ay+ W(t), y(0) = yo. 


Indeed, if we set y(t) = e’4a(t), then y(t) = Ay(t) + e'42'(t), so (4.57) is 
equivalent to 

a'(t)=e"*0(t),  2(0) = yo, 
hence 


x(t) = yo +f e *40(s) ds. 


Applying e‘4 to both sides yields 


t 
(4.58) y(t) = ey +f eA B(s) ds. 
0 


Note how this partially generalizes the formula (2.21). This formula is a special 
case of Duhamel’s principle, which will be discussed further in §5. 

We remark that the definition of e’4 by power series (4.3) extends to the 
case where A is a bounded linear operator on a Banach space. In that case, e'4 
furnishes the simplest sort of example of a one-parameter group of operators. 
Compare §9 in Appendix A, Functional Analysis, for a further discussion of semi- 
groups of operators. A number of problems in PDE amount to exponentiating 
various unbounded operators. The discussion of eigenvalues, eigenvectors, and 
normal forms above relies heavily on finite dimensionality, although a good deal 
of it carries over to compact operators on infinite-dimensional Banach and Hilbert 
spaces; see 86 of Appendix A. Also, there is a somewhat more subtle extension 
of Theorem 4.5 for general self-adjoint operators on a Hilbert space, which is 
discussed in §1 of Chap. 8. 


Exercises 


1. In addition to the operator norm || A|| of an n x n matrix, defined by (4.5), we consider 
the Hilbert-Schmidt norm || A||Hs, defined by 


\|Allits = S5 laze”, 


Ik 
if A = (aj,). Show that 
I| All < ||Allus. 
(Hint: If r1,...,7n are the rows of A, then for u € C”, Au has entries r; -u, 1 < 


j <n. Use Cauchy’s inequality (4.9) to estimate |r; - u|?.) 


Show also that 
> lazal? < || Al]? for each k, 
j 


and hence 
\|Allas < nl] All. 


Exercises 2 


(Hint: || A|| > || Aex|| for each standard basis vector ex.) 
. Show that, in analogy with (4.11), we have 


| AB|lus < || Allxs|| Bllus- 


Indeed, show that 
| ABllus < ||All - | Bllss, 


where the first factor on the right is the operator norm || Al]. 


. Let X be ann X n matrix. Show that 


x Tr X 
dete“ =e". 


(Hint: Use a normal form.) 

Let M,, denote the space of complex n x n matrices. If A € M,, and det A = 1, we 
say that A € SL(n,C). If X € M,, and Tr X = 0, we say that X € sl(n,C). 

. Let X € sl(2,C). Suppose X has eigenvalues {A,—A}, A A 0. Such an X can be 
diagonalized, so we know that there exist matrices Z; € Moe such that 


LX tr = 4% 
e~ =Me°+Zoe . 


Evaluating both sides at t = 0, and the t-derivative at ¢ = 0, show that 27; + Z2 = 
I, AZ, — XZq = X, and solve for 71, Zz. Deduce that 


e’* = (cosh tA) + A~*(sinh t)X. 


. Define holomorphic functions C(z) and S'(z) by 


C(z) = cosh /z, S(z) = ars 


Deduce from Exercise 4 that, for X € sl(2,C), 
e* = C(—det X)I + S(— det X)X. 


Show that this identity is also valid when 0 is an eigenvalue of X. 
. Rederive the formula above for e*, X € sl(2,C), by using the power series for e* 
together with the identity 


X? = —(det X)I, X €sl(2,C). 
The next set of exercises examines the derivative of the map 
Exp: M, > Mn, Exp(X) = e*. 


. Set U(t,s) = e!**5"), where X and Y are n x n matrices, and set U, = OU/Os. 
Show that U, satisfies 


ous 
Ot 


=(X+sY)U.,+YU, U.(0,s) =0. 


. Use Duhamel’s principle, formula (4.58), to show that 
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U.(t, s) = a eli THXHSY) yy oT(X48Y) Ge 
0 
Deduce that 


1 
(4.59) a xt =e* / e 7 *Ve™ dr. 
0 0 


s= 


9. Given X € Mn, define ad X € End(,,), that is, 
ad X :M, —- Mn, 


by 
ad X(Y) = XY -YX. 


Show that 
otk yetX — otal Xy 


(Hint: If V (t) denotes either side, show that dV/dt = —(ad X)V, V(0) = Y.) 
10. Deduce from Exercise 8 that 


(4.60) a ecrer 


a = e* E(ad X)Y, 


s=0 


where =(z) is the entire holomorphic function 


1 —z 
«=f eit = 
0 


z 


(4.61) 


ea) 


The operator =(ad X) is defined in the following manner. For any L € End(C™) = 
Mm, any function F(z) holomorphic on |z| < a, with a > ||L]|, define F(L) by 
power series: 


(4.62) F(L) = © foL", where F(z) = > fn2”. 
n=0 n=0 


For further material on holomorphic functions of operators, see §6 in Appendix A. 

11. With Exp : M,, — Mr, as defined above, describe the set of matrices X such that the 
transformation D Exp(X) is not invertible. 

12. Let A : R” — R” be symmetric, and let Q(x) = (Az, x). Letv; € S"~* = {2 € 
R” : |x| = 1} be a point where Ol ce assumes a maximum. Show that v1 is an 
eigenvector of A. 

(Hint: Show that VQ(v1) is parallel to VE(v1), where E(x) = (a, 2).) 
Use this result to give an alternative proof of Corollary 4.6. Extend this argument to 
establish Theorem 4.5. 


In Exercises 13-15, let 
Ey = {e*'p(t) : p(t) polynomial}. 


13. Show that 


(4.63) A£#0 O= £ > Ey E is injective. 
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(In fact, it is an isomorphism.) 
14. Assume K € N and {Ai,..., Ax} are distinct. Let p;(t) be polynomials. Show that 


(4.64) etn (t) +--- + e**'px(t) = 0 => each p; = 0. 


Hint. Use induction on kK’. Case K = 1 easy. Say kK > 2 and the hypothesis in (4.64) 
holds. We can arrange that \x = 0, so 


e**ps (t) +-+- +e**’pe(t) = —px(t), k= K-1. 


Apply 0’ with N > deg px. Use the inductive hypothesis in concert with (4.63) to 
show that each p; = 0. 

15. Discuss the relevance of Exercises 13-14 to the argument establishing the direct sum 
decomposition (4.39). 


5. Variable-coefficient linear systems of ODE: Duhamel’s 
principle 


Let A(t) be a continuous, n x n matrix-valued function of t € I. We consider the 
general linear, homogeneous ODE 


dy 


(5.1) an 


A(t)y, y(0) = yo. 
The general theory of §2 gives local solutions. We claim that the solutions here 


exist for all t € I. This follows from the discussion after the proof of Theorem 
2.1, together with the following estimate on the solution to (5.1). 


Proposition 5.1. /f || A(t)|| <M fort € I, then the solution to (5.1) satisfies 
(5.2) lIy(t)I| < e™" I[yoll- 
It suffices to prove this for t > 0. Then z(t) = e~™*y(t) satisfies 
(5.3) z=Ci(t)z, 2(0) = yo, 
with C(t) = A(t) — M. Hence C(t) satisfies 
(5.4) Re (C(t)u,u) <0, forall ue C”. 


Thus (5.2) is a consequence of the following energy estimate, which is of inde- 
pendent interest. 


Proposition 5.2. If z solves (5.3) and if (5.4) holds for C(t), then 


Il2(4)|| < l|2()||, for t > 0. 
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Proof. We have 


G.5) = 2 Re (C(t)z(t), z(t) 
Thus we have global existence for (5.1). There is a matrix-valued function 
S(t, 5) such that the unique solution to (5.1) satisfies 
(5.6) y(t) = S(t, s)y(s). 
Uniqueness results readily yield 
(5.7) S(t,s) = S(t,c)S(a, 8), 
for all t, s,o € I. In particular S(t, 7) = S(c,t)~1. Hence, if to € I, 
(5.8) M(t) = S(t, to) => S(t, s) = M(t)M(s)"?. 
Using this solution operator, we can treat the nonhomogeneous equation 
(5.9) y' = A(t)y + W(t), y(to) = yo. 
Indeed, if we set y(t) = M(t)a(t), then 


y (t) = M'(t)a(t) + M(t)a'(t) 


(5.10) = A(t)y(t) + M(e)2'(), 


so (5.9) is equivalent to M(t)a’(t) = b(t), hence 
(5.11) 7 =~ MW) OW), (to) = yo- 
Integrating and applying M(t) gives 
t 
(5.12) y(t) = M(t)yo + M(t) M(s)~'b(s) ds. 
to 


We hence have the following identity, known as Duhamel’s principle. 


Proposition 5.3. Let A : I — M(n,C) and b : I — C” be continuous, and 
assume to € I. Then the solution to (5.9) is given by 
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t 
(5.13) y(t) = S(t, to) yo +f S(t, s)b(s) ds. 
to 


Next we prove an identity that might be called the “noncommutative funda- 
mental theorem of calculus.” 


Proposition 5.4. Assume 0 € I. If A(t) is a continuous matrix function and 
S(t, 0) is defined as above, then, fort > 0, t € I, 


(5.14) S(t,0) = lim elt/MAUm—Dt/n) .., 6lt/n) A), 


nN—- Oo 
where there are n factors on the right. 


Proof. To prove this at t = T, divide the interval [0,7] into n equal parts. Set 
y = S(t, 0)yo, and define z,,(t) by z,(0) = yo and 
(5.15) Zy = AQjT/n)zn, for t € (JT /n,(§ + 1)T/n), 


7 


requiring continuity across each endpoint of these intervals. We see that 


(5.16) z= A(t)zn + Rn(t), 
with 
G7) |Rn(t)|| < onllen(t)||, om 30 asn > co. 


Meanwhile we see that ||z,,(£)|| < Cr||yo|| on [0, 7]. We want to compare z,,(t) 
and y(t). We have 


(5.18) — (zn — y) = A(t)(2n — y) + Rn(£); zn(0) — y(0) =0. 


Hence Duhamel’s principle gives 


t 
(5.19) Aine i Sli a\RG(s) de, 

0 
and since we have an a priori bound ||.S(t, s)|| < K for |s], |t] < 7, we get 
(5.20) lzn(t) — y(t)|| < KT Cro, ||yo|| + 0 asn — oo, |t| < T. 


In particular, z,(T) > y(T) as n — oo. Since z,,(T) is given by the right side of 
(5.14) with t = T, this proves (5.14). 
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Exercises 


1. Let A(t) and X(t) be n x n matrices satisfying 


dX 
< = AX. 


We form the Wronskian W(t) = det X(t). Show that W satisfies the ODE 
—— =a(t)W, a(t) =Tr A(t). 

(Hint: Use Exercise 2 of §1 to write dW/dt = Tr(Cof(X)'dX/dt), and use Cramer’s 
formula, (det X)X~1 = Cof(X)‘. Alternative: Write X(t + h) = e"4 X(t) + 
O(h?) and use Exercise 3 of §4 to write dete"4 = e’@, hence W(t + h) = 
eh W(t) + O(h?).) 

2. Let u(t) = ||y(4)||?, for a solution y to (5.1). Show that 

(5.21) u' < M(t)u(t), 


provided ||A(t)|| < M(t)/2. Such a differential inequality implies the integral 
inequality 


t 
(5.22) u(t) < A +f M(s)u(s)ds, t>0, 
0 


with A = u(0). The following is a Gronwall inequality; namely, if (5.22) holds for a 
real-valued function u, then provided M(s) > 0, we have, for ¢ > 0, 


t 
(5.23) u(t) < AeN®, = N(t) = / M(s) ds. 
0 
Prove this. Note that the quantity dominating u(t) in (5.23) is equal to U, solving 
U(0) = A, dU/dt = M(t)U(t). 


3. Generalize the Gronwall inequality of Exercise 2 as follows. Assume F'(t,u) and 
0, F(t, wu) are continuous, let U be a real-valued solution to 


(5.24) U' = F(t,U), U(0) =A, 
and let wu satisfy the integral inequality 
t 
(5.25) wih A+ f Beales 
0 
Then prove that 
(5.26) u(t) < U(t), fort > 0, 
provided 0F'/Ou > 0. Show that this continues to hold if we replace (5.24) by 
t 
(5.19a) U(t) => A+ f F(s,U(s)) ds. 
0 


(Hint: Set v = u — U. Then (5.19a) and (5.25) imply 
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v(t) < i [F(s, u(s)) — F(s, U(s))] ds = | M(s)v(s) ds, 
where : 
M(s) = : F,,(s,ru(s) + (1 — 7)U(s)) dr. 


Thus (5.22) applies, with A = 0.) 

. Let x(t) be a smooth curve in R*; assume it is parameterized by arc length, so T(t) = 
x(t) has unit length; T(t) - T(t) = 1. Differentiating, we have T’(t) | T(t). The 
curvature is defined to be «(t) = ||T’(t)||. If «(¢) 4 0, we set N(t) = T’/||T" |, so 


T =KN, 


and N is a unit vector orthogonal to T. We define B(t) by 


(5.22) B=TxN. 


Note that (7, N, B) form an orthonormal basis of R? for each t, and 


(5.23) T=NxB, N=BxT. 


By (5.22) we have B’ = T x N’. Deduce that B’ is orthogonal to both T and B, hence 
parallel to NV. We set 

B' =-TN, 
for smooth 7(t), called the torsion. 
. From N’ = B’ x T+ B x T" and the formulas for T’ and B’ given in Exercise 4, 
deduce the following system, called the Frenet—Serret formula: 


T= KN 
(5.24) N' =—-«T +7TB 
B= —TN 
Form the 3 x 3 matrix 
0-K O 
(5.25) A(t)=|«% 0-7], 
O07 O 


and deduce that the 3 x 3 matrix F(t) whose columns are T, N, B, 
F = (T, N, B), 
satisfies the ODE 
F' = F A(t). 


. Derive the following converse to the Frenet—Serret formula. Let T(0), N (0), and B(0) 
be an orthonormal set in R3, such that B(0) = T(0) x N(0); let «(t) and r(t) be given 
smooth functions; and solve the system (5.24). Show that there is a unique curve x(t) 
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such that (0) = 0 and T(t), N(t), and B(t) are associated to x(t) by the construction 
in Exercise 4, so in particular the curve has curvature «(t) and torsion 7(t). 
(Hint: To prove that (5.22) and (5.23) hold for all t, consider the next exercise.) 

7. Let A(t) be a smooth, n x n real matrix function that is skew-adjoint for all t (of which 
(5.25) is an example). Suppose F'(t) is areal n x nm matrix function satisfying 


F' = F A(t). 


If F'(0) is an orthogonal matrix, show that F'(t) is orthogonal for all ¢. 
(Hint: Set J(t) = F(t)* F(t). Show that J(¢) and Jo(t) = I both solve the initial-value 
problem 


J’=[J,A()], J(0) =.) 


8. Let Ui; = T, Uz = N and U3 = B, and set w(t) = tT + &B. Show that (5.24) is 
equivalent toU; =w x Uj, 1 <j < 3. 
9. Suppose 7 and « are constant. Show that w is constant, so T'(t) satisfies the constant- 
coefficient ODE 
T (t) =w x T(t). 


Note that w - T'(0) = 7. Show that after a translation and rotation, x(t) takes the form 


y(t) = (0776 cos At, \77« sin At, A-* rt), Man +77, 


6. Dependence of solutions on initial data and on other 
parameters 


We consider how a solution to an ODE depends on the initial conditions. Consider 
a nonlinear system 


(6.1) y=Fly), y(0)=2. 

As noted in §2, we can consider an autonomous system, such as (6.1), without 

loss of generality. Suppose F’' : U — R” is smooth, U C R” open; for simplicity 

we assume U is convex. Say y = y(t, 2). We want to examine smoothness in «. 
Note that formally differentiating (6.1) with respect to x suggests that W = 

D, y(t, x) satisfies an ODE called the linearization of (6.1): 

(6.2) W'=DF(y)W, W(0)=T. 

In other words, w(t, 7) = Dz y(t, x) wo satisfies 

(6.3) w =DF(y)w, w(0) = wo. 


To justify this, we want to compare w(t) and 


(6.4) 2(t) = y(t) — y(t) = y(t, @ + wo) — y(t, 2). 
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In fact, D, y(t, x) is defined by 
(6.5) y(t,@ + wo) = y(t, 2) + Dry(t, x)wo + o(||woll), 


so to show that D,.y(t,2)wo exists and is equal to the solution to (6.3), we need 
to show that 


(6.6) 2(t) — w(t) = o(||woll). 


It will be convenient to show that z satisfies an ODE similar to (6.3). Indeed, 
z(t) satisfies 


(6.7) z= F(m)— Fly) =8(y,y)z, 2(0) = wo, 
where 
1 
(6.8) e(y.u) =f DE(rn += n)u) ar. 
0 


If we assume that 
(6.9) ||DF(u)|| <M, forue U, 
then the solution operator S(t, 0) of the linear ODE d/dt — B(t), with B(y) = 


(yi (t), y(t)), satisfies a bound ||.$(t,0)|| < el! as long as y(t) and y(t) 
belong to U. Hence 


(6.10) lIyi(t) — y(t)|| < elf! || wo]. 


This establishes that y(t, ) is Lipschitz in x. 
To continue, since ®(y, y) = DF'(y), we rewrite (6.7) as 


(6.11) z= O(y+z,y)z= DF(y)z+R(y,z), w(0) = wo, 
where 
(6.12) FEC (U) = ||R(y,2)ll = o(llzll) = o(||woll). 


Now comparing (6.11) with (6.3), we have 


(6.13) S(@-u) =P) wR, Geo. 


Then Duhamel’s principle yields 
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t 
(6.14) z(t) — w(t) =| S(t, s)R(y(s), 2(s)) ds, 
0 


so by the bound ||S(¢, s)|| < e!#-*! and (6.12), we have (6.6). 

This is precisely what is required to show that y(t, x) is differentiable with 
respect to x, with derivative W = D,y(t,«x) satisfying (6.2). We state our first 
result. 


Proposition 6.1. If F € C1(U), and if solutions to (6.1) exist for t € (—To,T1), 
then for each such t, y(t,x) is C' in x, with derivative D,y(t,x) = W(t, x) 
satisfying (6.2). 


So far we have shown that y(t, x) is both Lipschitz and differentiable in x, but 
the continuity of W(t, x) in x follows easily by comparing the ODEs of the form 
(6.2) for W(t, «) and W(t, x + wo), in the spirit of the analysis of (6.13). 

If F' possesses further smoothness, we can obtain higher differentiability of 
y(t, x) in x by the following trick. Couple (6.1) and (6.2), to get an ODE for 


(y, W): 


y 
Fy), 
(6.15) y (y) 
W' = DF(y)W, 
with initial conditions 
(6.16) y(0) =a, W(0)=T. 


We can reiterate the preceeding argument, getting results on D,.(y, W), that is, on 
D2y(t, 2), and continue, proving: 


Proposition 6.2. If F € C*(U), then y(t, x) is C* in x. 
Similarly, we can consider dependence of the solution to a system of the form 


dy _ 


(6.17) an 


P(r, Y); y(0) =x 


on a parameter 7, assuming F' is smooth jointly in 7, y. This result can be deduced 
from the previous one by the following trick: Consider the ODE 


(6.18) y=Foy), 2 =0F gOS e 210) =7 
Thus we get smoothness of y(t, 7, x) in (7, x). As one special case, let F'(7, y) = 
TF ‘(y). In this case y(to, 7,2) = y(Tto, 1,2), so we can improve the conclusion 


of Proposition 6.2 to the following: 


(6.19) Fe C8(U) = y € C* jointly in (t,x). 
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It is also true that if F’ is analytic, then one has the analytic dependence of 
solutions on parameters, especially on t, so that power-series techniques work in 
that case. One approach to the proof of this is given in the exercises below, and 
another at the end of 89. 


Exercises 


1. Let Q be open in R”, identified with C”, via z = x + iy. Let X : Q — R?*” 
have components X = (a1,...,@n,b1,..., bn), where a; (x, y) and b; (a, y) are real- 
valued. Denote the solution to du/dt = X(u),u(0) = z by u(t, z). Assume f;(z) = 
a;(z) + ib; (z) is holomorphic in z, that is, its derivative commutes with J, acting on 
R?* = C¥ as multiplication by i. Show that, for each t, u(t, z) is holomorphic in z, 
that is, D,u(t, z) commutes with J. 

(Hint: Use the linearized equation (6.2) to show that A(t) = [W(t), J] satisfies the 
ODE 
K' =DX(z)K, K(0)=0.) 


2. If O C R” is open and F : O — R” is real analytic, show that the solution y(t, x) to 
(6.1) is real analytic in x. 
(Hint: With F = (a1,...,@n), take holomorphic extensions f;(z) of a;(x) and use 
Exercise 1.) 
Using the trick leading to (6.19), show that y(t, x) is real analytic jointly in (t, x). 


In the next set of problems, consider a linear ODE of the form 


(6.20) A(a) = B()u, O<a <1, 
where we assume that the nm x m matrix functions A and B have holomorphic exten- 
sions to A = {z € C: |z| < 1}, such that det A(z) = 0 at z = 0, but at no other 
point of A. We say z = 0 is a singular point. Let wi(x),...,un(x) be n linearly 
independent solutions to (6.20), obtained, for example, by specifying u at x = 1/2. 
3. Show that each u; has a unique holomorphic extension to the universal covering sur- 
face M of A \ 0, and show that there are cj; € C such that 


uj(e7"*x) = baer ur(a), O<a@<1. 
k 


4. Suppose the matrix C' = (cj) is diagonalizable, with eigenvalues \g € C, 1 << 
n. Show that there is a basis of solutions vz to (6.20) such that 


ve(e?"*x) = rv ve(2), 
and hence, picking ae € C such that ere = No 
ve(x) = 2°’ we(x); we holomorphic on A \ 0. 


5. Suppose || A(z)~1B(z)|| < K|z|~?. Show that ||ve(z)|| < Clz|7*. Deduce that each 
we(z) has at most a pole at z = 0; hence, shifting ay by an integer, we can assume 
that we is holomorphic on A. (Hint: Recall the statement of Gronwall’s inequality, in 
Exercises 2 and 3 of §5.) 
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6. Suppose that instead of C’ being diagonalizable, it has the Jordan normal form 


A 1 

0 xX 
(in case n = 2). What can you say? Generalize. 

7. If a(z) and b(z) are holomorphic on A, convert 
vu" (x) + wa(a)u' (a) + b(a)u(x) = 0 

to a first-order system to which Exercises 3-6 apply. (Hint. Take v = xu’ rather than 
v=u',) 
The next set of exercises deals with certain small perturbations of the system + = 


—y, y = x, whose solution curves are circles centered at the origin. 
8. Leta = a-(t), y = ye(t) solve 


é=-yte(a’+y), y=z, 


with initial data x(0) = 1, y(0) = 0. Knowing smooth dependence on ¢, find ODEs 
for the coefficients x; (t), y; (£) in power-series expansions 


a(t) = xo(t) + err (t) +e7ro(t)+---, y(t) = yo(t) +eys(t) + e7yo(t) +---. 


9. Making use of the substitution €(t) = —a(—t), n(t) = y(—t), show that, for fixed 
initial data and ¢ sufficiently small, the orbits of the ODE in Exercise 8 are periodic. 
10. Show that, for ¢ small, the period of the orbit in Exercise 8 is a smooth function of e. 
Compute the first three terms in its power-series expansion. 


7. Flows and vector fields 

Let U C R” be open. A vector field on U is a smooth map 
(7.1) X:U —>R". 

Consider the corresponding ODE: 

(7.2) y =X(y), y(0) =2, 


with « € U. A curve y(t) solving (7.2) is called an integral curve of the vector 
field X. It is also called an orbit. For fixed t, write 


(7.3) y =y(t,2) = FE(2). 


The locally defined F{,, mapping (a subdomain of) U to U, is called the flow 
generated by the vector field X. 


7. Flows and vector fields 39 


The vector field X defines a differential operator on scalar functions, as 
follows: 


a 7 d 
(74) Lx f(x) = him bh [f(Fxe) — f(@)] = FFF x2)| 0° 
We also use the common notation 


(7.5) Lx f(x) = Xf, 


that is, we apply X to f as a first-order differential operator. 
Note that if we apply the chain rule to (7.4) and use (7.2), we have 


of 


? 
Ox; 


(7.6) Lx f(x) = X(x)- Vf (2) = > a;(a) 


if X = }\a,;(x)e;, with {e;} the standard basis of R”. In particular, using the 
notation (7.5), we have 


(7.7) a;(x) = Xa3. 
In the notation (7.5), 
(7.8) X= ye ne. 
Ox; 
We note that X is a derivation, that is, a map on C™(U), linear over R, satis- 
fying 
(7.9) X(f9) =(Xf)g + f(Xg). 


Conversely, any derivation on C®(U) defines a vector field, namely, has the form 
(7.8), as we now show. 


Proposition 7.1. [f X is a derivation on C™®(U), then X has the form (7.8). 
Proof. Set a;(x) = Xx;, X# =) aj;(x)0/Ox;, and Y = X — X*#. Then Y is 


a derivation satisfying Y 2; = 0 for each j; we aim to show that Y f = 0 for all 
f. Note that whenever Y is a derivation, 


1-l=15Y-1=2Y-15Y.-1=0, 


that is, Y annihilates constants. Thus, in this case Y annihilates all polynomials 
of degree < 1. 

Now we show that Y f(p) = 0 for all p € U. Without loss of generality, we can 
suppose p = 0, the origin. Then, by (1.8), we can take b;(x) = Jo (Gf) (ta) dt, 
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and write 
F(a) = f(0) + > dj(a)a;. 


It immediately follows that Y f vanishes at 0, so the proposition is proved. 


If U is a manifold, it is natural to regard a vector field X as a section of the 
tangent bundle of U, as explained in Appendix B. Of course, the characterization 
given in Proposition 7.1 makes good invariant sense on a manifold. 

A fundamental fact about vector fields is that they can be “straightened out” 
near points where they do not vanish. To see this, suppose a smooth vector field 
X is given on U such that, for a certain p € U, X(p) # 0. Then near p there is a 
hypersurface M that is nowhere tangent to _X. We can choose coordinates near p 
so that p is the origin and M is given by {x,, = 0}. Thus, we can identify a point 
x’ € R”~! near the origin with x’ € M. We can define a map 


(7.10) F:M x (-to, tp) —U 
by 
(7.11) F(a’ ,t) = Fe (z'). 


This is C'°° and has surjective derivative and so by the inverse function theorem 
is a local diffeomorphism. This defines a new coordinate system near p, in which 
the flow generated by X has the form 


(7.12) Fs (a2',t) = (2’,t+ 8). 


If we denote the new coordinates by (u1,..., Un), we see that the following result 
is established. 


Theorem 7.2. If X is a smooth vector field on U with X(p) 0, then there 


exists a coordinate system (u1,...,Un) centered at p (so u;(p) = 0) with respect 
to which 
3) 
7.13 = >—. 
(7.13) Du, 


We now make some elementary comments on vector fields in the plane. Here 
the object is to find the integral curves of 


0 ) 
(7.14) fla u)a~ + 9a, Yay 
that is, to solve 


(7.15) a = f(2,9), 9 =9(2,y)- 
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This implies 


dy _ 9(@,¥) 
dz f(x,y)’ 


or, written in differential-form notation (which will be discussed more thoroughly 
in §13), 


(7.16) 


(7.17) g(x,y) dx — f(x,y) dy = 0. 
Suppose we manage to find an explicit solution to (7.16): 


(7.18) y=(r), z= Hy). 


Often it is not feasible to do so, but ODE texts frequently give methods for doing 
so in some cases. Then the original system becomes 


(7.19) x = f(z,¢(a)), y = g(v(y),y). 


In other words, we have reduced ourselves to integrating vector fields on the line. 
We have 


/ Lf(a, o(a))] Tde = t+ Ch, 


(7.20) 
flow. y)| dy =t+ Co. 


If (7.18) can be explicitly achieved, it may be that one integral or the other in 
(7.20) is easier to evaluate. With either x or y solved as a function of t, the other 
is determined by (7.18). 

One case when the planar vector field can be integrated explicitly (locally) is 
when there is a smooth u, with nonvanishing gradient, explicitly given, such that 


(7.21) Xu=0, 


where X is the vector field (7.14). One says wu is a conserved quantity. In such 
a case, let w be any smooth function such that (u,w) form a local coordinate 
system. In this coordinate system, 


(7.22) X = W(u, w)— 
by (7.7), so 


(7.23) ome 
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with 


(7.24) v(u, w) = [ b(u, s)~! ds, 


(0) 


and the local coordinate system (u,v) linearizes X. 


Exercises 


1. Suppose h(x, y) is homogeneous of degree 0, that is, h(ra,ry) = h(x,y), so 
h(a, y) = k(a/y). Show that the ODE 


dy _ 


is changed to a separable ODE for u = u(x), if u = y/a. 


2. Using Exercise 1, discuss constructing the integral curves of a vector field 


X= fawe ola u) 5 


when f(x,y) and g(a, y) are homogeneous of degree a, that is, 
f(ra,ry) =r* f(x,y) forr > 0, 


and similarly for g. 
3. Describe the integral curves of 
6) 


0 
2 2 


4. Describe the integral curves of 


O 0 
A —+B — 
when A(x, y) = aiz + a2y+a3, B(x, y) = biz + boy + b3. 
5. Let X = f(x, y)(0/0x)+9(2, y)(0/Oy) be a vector field on a disc 2 C R?. Suppose 
that div X = 0, that is, Of /Ox + Og/Oy = 0. Show that a function u(a, y) such that 


Ou Ou 
ac Ay 


f 


is given by a line integral. Show that Xu = 0, and hence integrate X. 
Reconsider this problem after reading §13. 
6. Find the integral curves of the vector field 


O 
X= (Qay ty? +1)5- +(e? +1—-y’) 


7. Show that 
div(e”X) = e’ (div X + Xv). 
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Hence, if X is a vector field on 2 C R?, as in Exercise 5, show that you can integrate 


X if you can construct a function v(x, y) such that Xv = —div X. Construct such v 
if either er fom 
iv iv 
= (x) or = ly): 
f(x,y) (2,4) 
For now, we define div X = 0X 1/021 +---+0Xn/Oxn. See Chap. 2, §2, for another 
definition. 


8. Find the integral curves of the vector field 


a 
2 
D5," 


X = 2ny t (e? +y 


Let X be a vector field on R”, with a critical point at 0, that is, X(0) = 0. Suppose 
that for x € R” near 0, 


(7.25) X(a) =Ar+R(x), ||R(x)|| = O(\al|*), 


where A is an n X n matrix. We call Az the linearization of X at 0. 

9. Suppose all the eigenvalues of A have negative real part. Construct a quadratic poly- 
nomial Q : R” — [0, co), such that Q(0) = 0, (0°Q/0x; Axx) is positive-definite, 
and for any integral curve x(t) of X as in (7.25), 


d , 
Ge (et) < 0 ift > 0, 


provided x(0) = xo(¥ 0) is close enough to 0. Deduce that for small enough C, if 
||xol| < C, then x(t) exists for all t > 0 and x(y) — 0 as t > oo. 
(Hint: Take Q(x) = (a, x), using Exercise 10 below.) 

10. Let A be an n Xx n matrix, all of whose eigenvalues A; have negative real part. Show 
that there exists a Hermitian inner product (,) on C” such that Re (Au, u) < 0 for 
nonzero u € C”. (Hint: Put A in Jordan normal form, but with es instead of 1s above 
the diagonal, where ¢ is small compared with |Re A,|.) 


8. Lie brackets 


If F': V — W is a diffeomorphism between two open domains in R”, or between 
two smooth manifolds, and Y is a vector field on W, we define a vector field Fy. Y 
on V so that 


(8.1) Firyy =P 6 FoF, 

or equivalently, by the chain rule, 

(8.2) FyY (x) = (DF) (F(2))Y (F(a)). 

In particular, if U C R” is open and_X is a vector field on U defining a flow F*, 


then for a vector field Y, FY is defined on most of U, for |t| small, and we can 
define the Lie derivative, 
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d 


: h 
(8.3) £xY = lim hol (FRY -Y) = a 


ree #Y | 0? 
as a vector field on U. 
Another natural construction is the operator-theoretic bracket: 


(8.4) [X,Y] = XY -YX, 


where the vector fields X and Y are regarded as first-order differential operators 
on C™(U). One verifies that (8.4) defines a derivation on C’'°(U), hence a vector 
field on U. The basic elementary fact about the Lie bracket is the following. 


Theorem 8.1. If X and Y are smooth vector fields, then 
(8.5) LxY =[|X,Y]. 


Proof. Let us first verify the identity in the special case 


X=, Y= bilo) 


Then FuY = 5) bj(x + te1) 0/Ox;, so LxY = Y7(Ob;/Ix1) 0/Ox;, and a 
straightforward calculation shows that this is also the formula for [X, Y], in this 
case. 

Now we verify (8.5) in general, at any point x9 € U. First, if X is nonvanishing 
at Zo, we can choose a local coordinate system so the example above gives the 
identity. By continuity, we get the identity (8.5) on the closure of the set of points 
Xo, where X (a9) # O. Finally, if vp has a neighborhood where X = 0, clearly 
LxY =0and [X,Y] = 0 at xo. This completes the proof. 


Corollary 8.2. If X and Y are smooth vector fields on U, then 


d 


(8.6) aie’ = Feu[X,Y], 


for all t. 


Proof. Since locally Fi** = F3F%, we have the same identity for Fyigs which 
yields (8.6) upon taking the s-derivative. 


We make some further comments about cases when one can explicitly integrate 
a vector field X in the plane, exploiting “symmetries” that may be apparent. In 
fact, suppose one has in hand a vector field Y such that 


(8.7) [X,Y] =0. 


By (8.6), this implies Fj, 4X = X forall t; this connection will be pursued further 
in the next section. Suppose that one has an explicit hold on the flow generated by 
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Y, so one can produce explicit local coordinates (u,v) with respect to which 


0 
8.8 =. 
(8.8) Du 
In this coordinate system, write X = a(u,v)O/Ou + b(u, v)O/Ov. The condition 
(8.7) implies 0a/Ou = 0 = Ob/Ou, so in fact we have 


() 3) 
(8.9) X= av) a + bu) a 
Integral curves of (8.9) satisfy 
(8.10) uw =a(v), v! =b(v) 


and can be found explicitly in terms of integrals; one has 


(8.11) for dv=t+C\ 
and then 
(8.12) u= fico) dt + Co. 


More generally than (8.7), we can suppose that, for some constant c, 
(8.13) [X,Y] =cX, 
which by (8.6) is the same as 
(8.14) Fy yX =e "X. 


An example would be 


0 6) 
(8.15) X= f(z,y) as g(x,y) By’ 


where f and g satisfy “homogeneity” conditions of the form 
(8.16) f(rta,r°y) =r°~°f(a,y),  g(r*a,r°y) = r°-°g(a,y), 
for r > 0; in such a case one can take explicitly 


(8.17) Fy (x,y) = (ea, ey). 


46 1. Basic Theory of ODE and Vector Fields 


Now, if one again has (8.8) in a local coordinate system (u,v), then X must have 
the form 


(8.18) X =e [a(u)— +b(v)— 


(8.19) uw =e“a(v), vo = e™db(v)§ = — = ——. 


The hypothesis (8.13) implies that the linear span (over R) of X and Y is a 
two-dimensional, solvable Lie algebra. Sophus Lie devoted a good deal of effort 
to examining when one could use constructions of solvable Lie algebras of vector 
fields to integrate vector fields explicitly; his investigations led to his foundation 
of what is now called the theory of Lie groups. 


Exercises 


1. Verify that the bracket (8.4) satisfies the “Jacobi identity” 
[X, [Y, Z]] = [Y, [X, Z]| = [[X, Y], Z\, 


1.€., 


[Lx ,Ly|Z = Lix,y]Z. 


2. Find the integral curves of 


6) 

X= — 

Ce rae 

using (8.16). 
3. Find the integral curves of 

(6) 6) 
2 ae 5 2 5 ey? dey? 
(vyty)a +@ tay ty ) By 


9. Commuting flows; Frobenius’s theorem 
Let G: U > V bea diffeomorphism. Recall from 88 the action on vector fields: 
(9.1) G4Y (x) = DG) "Y(y), y= Ga). 


As noted there, an alternative characterization of GY is given in terms of the 
flow it generates. One has 


(9.2) FL oG=Go Fay: 
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The proof of this is a direct consequence of the chain rule. As a special case, we 
have the following 


Proposition 9.1. [fGyzY = Y, then Fy oG=Go Fy. 


From this, we derive the following condition for a pair of flows to commute. 
Let X and Y be vector fields on U. 


Proposition 9.2. If X and Y commute as differential operators, that is, 
(9.3) [X,Y] =0, 


then locally Fx and ee commute; in other words, for any po € U, there exists a 
5 > 0 such that for |s\,|t| < 6, 


(9.4) FF y po = Fy Fxpo- 


Proof. By Proposition 9.1, it suffices to show that F¥,Y = Y. This clearly 
holds at s = 0. But by (8.6), we have 


d Ss S 
de) X#Y a FxylX,Y], 


which vanishes if (9.3) holds. This finishes the proof. 


We have stated that given (9.3), the identity (9.4) holds locally. If the flows 
generated by X and Y are not complete, this can break down globally. For exam- 
ple, consider X = 0/0x1, Y = 0/Ox2 on R?, which satisfy (9.3) and generate 
commuting flows. These vector fields lift to vector fields on the universal covering 
surface M of R? \ (0,0), which continue to satisfy (9.3). The flows on M do not 
commute globally. This phenomenon does not arise, for example, for vector fields 
on a compact manifold. 

We now consider when a family of vector fields has a multidimensional integral 
manifold. Suppose X,,..., XX, are smooth vector fields on U which are linearly 
independent at each point of a k-dimensional surface & C U. If each _X; is tangent 
to 5 at each point, 3 is said to be an integral manifold of (X1,...,X;). 


Proposition 9.3. Suppose X1,...,Xx are linearly independent at each point of 
U and [X;, X¢| = 0 for all j, ¢. Then, for each xq € U, there is a k-dimensional 
integral manifold of (X1,...,X,) containing xo. 


Proof. We define a map F' : V > U, V aneighborhood of 0 in R*, by 
(9.5) Eteach) See ae Fe te: 


Clearly, (0/0t1)F = X(F). Similarly, since F . all commute, we can put any 


Fe first and get (0/Ot;)F = X,(F’). This shows that the image of V under F’ 
is an integral manifold containing xo. 
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We now derive a more general condition guaranteeing the existence of integral 
submanifolds. This important result is due to Frobenius. We say (X1,..., Xx) is 
involutive provided that, for each j, ¢, there are smooth b/‘(x) such that 


k 

(9.6) [Xj.Xe] = D— Ww) Xm. 
m=1 

The following is Frobenius’s theorem. 


Theorem 9.4. [f (X1,...,X%) are C™ vector fields on U, linearly independent 
at each point, and the involutivity condition (9.6) holds, then through each xo 
there is, locally, a unique integral manifold 1, of dimension k. 


We will give two proofs of this result. First, let us restate the conclusion as 


follows. There exist local coordinates (y1,..., Yn.) centered at 2 such that 
() 6) 
(9.7) span (X1,...,X% = span (=—,...,--). 
pene Oy OYK 


First proof. The result is clear for k = 1. We will use induction on k. So let 
the set of vector fields X1,...,X;41 be linearly independent at each point and 
involutive. Choose a local coordinate system so that X74, = 0/Ou,. Now let 


7) , fs) 


Since in (w1,...,Un) coordinates, no Y;,..., Y, involves 0/Ou1, neither does 
any Lie bracket, so 


[Y;, Ye] € span (Y1,...,Yx), Re SR. 


Thus (Yi,..., Y,) is involutive. The induction hypothesis implies that there exist 
local coordinates (y1,..., Yn) such that 


a 0 
span (Yi,...,Y¥%) = span Cae: 


Now let 


k 
6) 6) 
Z = Yeu — SY Sa ie a 
(9.9) k+l 2 b+1ye) Pa 2 k+1Ye) Duis 


Since, in the (u1,..., Un) coordinates, Yi,..., Y; do not involve 0/Ou1, we have 


[Yaris ¥;] € span (Y1, age ge 
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Thus [Z, Y;] € span (Yi,...,¥,) for 7 < k, while (9.9) implies that [Z, 0/Oy,| 
belongs to the span of (0/Oyn41,---,;0/OYn), for 7 < k. Thus we have 


) 

[2,—] =i <a 
Oy; 

Proposition 9.3 implies span (0/Oy1,...,0/Oy,, Z) has an integral manifold 

through each point, and since this span is equal to the span of X1,...,X%41, 

the first proof is complete. 


Second proof. Let X,,...,X;, be C™ vector fields, linearly independent at each 
point and satisfying the condition (9.6). Choose an (n — k)-dimensional surface 
O c U, transverse to X1,..., Xx. For V a neighborhood of the origin in R*, 
define ®: V x O > U by 


(9.10) DLiyensstp) PG ere fe. 


We claim that, for x fixed, the image of V in U is a k-dimensional surface © 
tangent to each X,, at each point of X. Note that since ®(0,...,t;,...,0,2) = 
Fy x, we have 


(9.11) 8055308) = 0). red. 
at, 


To establish the claim, it suffices to show that Fy pXe is a linear combi- 
nation with coefficients in C°(U) of X1,..., Xz. This is accomplished by the 
following: 


Lemma 9.5. Suppose [Y, Xj] = >>) Aje(a)Xe, with smooth coefficients \j0(x). 
Then Fy yp Xj is a linear combination of X,,...,X, with coefficients in C™® (U). 


Proof. Denote by A the matrix (A;¢), and let A(t) = A(t,x) = (Aje(Fy-2)). 
Now let A(t) = A(t, 2) be the unique solution to the ODE 


(9.12) A’(t) = A(t)A(t), A(O) = TI. 
Write A = (aj¢). We claim that 


(9.13) Fi pX; = So aje(t, x) Xe. 
L 


This formula will prove the lemma. Indeed, we have 


Le aie eta Ml aa 


dt 
= (Fy)# Do AjeXe 
L 
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= So (ajeo Fy )(Fhy Xe). 
£ 


Uniqueness of the solution to (9.12) gives (9.13), and we are done. 


This completes the second proof of Frobenius’s theorem. 


Exercises 


1. Let 2 be open in R?”, identified with C” via z = a + iy. Let 


x- p> laste.) i + bj (a, y) | 


be a vector field on 2, where a;(x,y) and b;(x,y) are real-valued. Form f;(z) = 
a;(z) + 1b; (z). Consider the vector field 


ro) ) 
y=Ix= [ben 2 +e 2]. 
7) 


Fi 


Show that X and Y commute, that is, [X,Y] = 0, provided f(z) is holomorphic, 
namely if the Cauchy—Riemann equations hold: 


2. Assuming f;(z) = a;(z) + 1b; (z) are holomorphic, show that, for z € , 
2(t,8) = Fx Fez 


satisfies 0z/Os = JOz/0t, and hence that z(t, s) is holomorphic in ¢ + is. 

3. Suppose a;(a) are real analytic (and real-valued) on O C R”. Let X = 
> a;(x)0/Ox;. Show that, for x € O, a(t) = Fx is real analytic in t (for ¢ 
near 0), by applying Exercises 1 and 2. 

Compare the proof of this indicated in Exercise 2 of §6. 

4. Discuss the uniqueness of integral manifolds arising in Theorem 9.4. 

5. Let A; be smooth m x m matrix-valued functions on O C R”. Suppose the operators 
L; = 0/0x;+Aj(x), acting on functions with values in R™, all commute, 1 < j <n. 
If p € O, show that there is a solution in a neighborhood of p to 


L;u = 0, 1l<j<n, 


with u(p) € R™ prescribed. 


10. Hamiltonian systems 


Hamiltonian systems arise from classical mechanics. As a most basic example, 
consider the equations of motion that arise from Newton’s law F' = ma, where 


10. Hamiltonian systems 
the force F' is given by 
(10.1) F = — grad V(2), 
with V the potential energy. We get the ODE 


dex OV 


We can convert this into a first-order system for (x, €), where 
(10.3) E€=m— 


is the momentum. We have 


(10.4) == = 
Now consider the total energy 


1 
(10.5) F(a, §) = 5 |e? + V(@). 
Note that Of /0€ = €/m and Of /Ox = OV/Ozx. Thus (10.4) is of the form 


dx; Of dé; of 
10. is , 
Oop dt 0g;’ dt Ox; 


Hence we’re looking for the integral curves of the vector field 


woof a of a 
(10.7) Be le ae Ox; OF; 1 


j=1 


51 


For smooth f(x, &), we call H f» defined by (10.7), a Hamiltonian vector field. 


Note that, directly from (10.7), 
(10.8) Hy f =0. 
A useful notation is the Poisson bracket, defined by 
(10.9) {fg} = Hyg. 
One verifies directly from (10.7) that 


(10.10) {f,9} = —{9, ft; 
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generalizing (10.8). Also, a routine calculation verifies that 
(10.11) [Hy, Hy] = Hypo}. 


As noted at the end of §7, if X is a vector field in the plane and we explicitly 
have a function wu with nonvanishing gradient such that Xu = 0, then X can be 
explicitly integrated. These comments apply to X = Hy, u = f, when Hy is 
a planar Hamiltonian vector field. We can rephrase this description as follows. If 
x €R,€é € R, then integral curves of 


Of Of 
fd (a 
(10.12) so Oe’ Aa 
lie on a level set 
(10.13) f(a,§) =E. 


Suppose that locally this set is described by 


(10.14) x= y(€) or €=V (a). 


Then we have one of the following ODEs: 


(10.15) a = fe(x,Y(@)) or & = —fe(v(€),§), 


and hence we have 


(10.16) [ felow@) a =t+C 
or 
(10.17) — f felv(@),6) "dg = 84% 


Thus, solving (10.12) is reduced to a quadrature, that is, a calculation of an explicit 
integral, (10.16) or (10.17). 

If the planar Hamiltonian vector field Hy arises from describing motion in a 
force field on a line, via Newton’s laws given in (10.2), so that 


(10.18) f(x, ) = ae +V(z), 


then the second curve in (10.14) is 
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(10.19) € = £[(2m)(E-V(2))]"”, 


and the formula (10.16) becomes 


-(=) [le — V(z)] WM? dy = tt C, 


defining x implicitly as a function of tf. 

In some cases, the integral in (10.20) can be evaluated by elementary means. 
This includes the trivial case of a constant force, where V(x) = cx, and also 
the case of the “harmonic oscillator” or linearized spring, where V(x) = cx?. It 
also includes the case of the motion of a rocket in space, along a line through the 
center of a planet, where V(x) = —K‘/|z|. This gravitational attraction problem 
for motion in several-dimensional space will be studied further in §§16 and 17. 
The case V(x) = —K cos « arises in the analysis of the pendulum (see (12.38)). 
In that case, (10.20) is an elliptic integral, rather than one that arises in first-year 
calculus. 

For Hamiltonian vector fields in higher dimensions, more effort is required to 
understand the resulting flows. The notion of complete integrability provides a 
method of constructing explicit solutions in some cases, as will be discussed in 
8816 and 17. 

Hamiltonian vector fields arise in the treatment of many problems in addition 
to those derived from Newton’s laws in Cartesian coordinates. In §11 we study the 
equations of geodesics and then show how they can be transformed to Hamilto- 
nian systems. In §12 this is seen to be a special case of a broad class of variational 
problems, which lead to Hamiltonian systems, and which also encompass classi- 
cal mechanics. This variational approach has many convenient features, such as 
allowing an easy formulation of the equations of motion in arbitrary coordinate 
systems, a theme that will be developed in a number of subsequent sections. 


(10.20) 


Exercises 


1. Verify that [Hy, Hg] = Hypo}. 
2. Demonstrate that the Poisson bracket satisfies the Jacobi identity 


(Hint: Use Exercise 1 above and Exercise | of 88.) 
3. Identifying y and €, show that a planar vector field X = f(a,y)(O/Ox) + 


g(x, y)(O/Oy) is Hamiltonian if and only if div X = 0. 
Reconsider Exercise 5 in §7. 
4. Show that 


£ g(x,€) = {fig} 


on an orbit of Hr. 
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5. If X = >> X;(x)O/Oz; is a vector field on U C R”, associate to X a function on 
UxR’ &T*U: 


(10.22) sx(x,€) = (X,6) = S-& X;(@ 
Show that 
(10.23) S(x,y] = {8x, sy}. 


11. Geodesics 


Here we define the concept of a geodesic on a region with a Riemannian metric 
(more generally, a Riemannian manifold). A Riemannian metric on 2 C R” is 
specified by g(x), where (g;,,) is a positive-definite, smooth, n x n matrix- 
valued function on 9. If U = Sou’ (x)0/Ox; and V = > vi (x)0/Ox; are two 
vector fields on Q, their inner product is the smooth scalar function 


(11.1) (U,V) = 9jx(2) w (x)v* (a), 


using the summation convention (i.e., summing over repeated indices). If 0 is a 
manifold, a Riemannian metric is an inner product on each tangent space T),Q, 
given in local coordinates by (11.1). Thus, (g;:,) gives rise to a tensor field of type 
(0, 2), that is, a section of the bundle @?T*Q. 

If y(t), a << t < b, is a smooth curve on Q), its length is 


1/2 


(11.2) v= fwrola= fp [smoorgorneo]’” ae 


A curve 7 is said to be a geodesic if, for |t, — tg| sufficiently small, tj € (a, b], the 
curve y(t), ti < t < tg, has the shortest length of all smooth curves in 2 from 
-y(t1) to (ta). 

We derive the ODE for a geodesic. We start with the case where 2. has the 
metric induced from a diffeomorphism 2 — S$, S a hypersurface in R"*!; we 
will identify and S' here. (See the exercises for a parallel treatment of S C 
R*, k > n.) This short computation will serve as a guide for the general case. 

So let yo(t) be a smooth curve in S (a < t < b), joining p and gq. Suppose 
y,(t) is a smooth family of such curves. We look for a condition guaranteeing 
that yo(t) has minimum length. Since the length of a curve is independent of its 
parameterization, we may additionally suppose that 


(11.3) \|7(t)|| =o, constant, for a<t<b. 


Let N denote a field of normal vectors to S'.. Note that 
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) 
(11.4) V = —y7,(t) LN. 

Os 
Also, any vector field V  N over the image of yp can be obtained by some 
variation y; of yo, provided V = 0 at p and gq. Recall that we are assuming 
ys(a) =p, Ys(b) = q. If L(s) denotes the length of 7;, we have 


b 
(1s) Ls) =f hns(oll at, 
and hence 
/ 1 i / —1 0 / / 
L(s)== | (ysl a- Cs (), 7 (6) at 
2 Ja Os 
(11.6) 1 Pye 
ee oA ! = 
== | (gH. %(0) at, ats=0. 
Using the identity 


in (Paste). a6(0) = (Sbo.4t0) + (Sl. 140), 


together with the fundamental theorem of calculus, in view of the fact that 


(11.8) — s(t) =0, att =aandb, 
Os 
we have 
1 b 
(11.9) L'(s) = oF (V(t), v4 (E)) dt, ats=0. 
0 Ja 


Now, if yo were a geodesic, we would have 
(11.10) L'(0) = 0, 


for all such variations. In other words, we must have y(t) -L V for all vector 
fields V tangent to S (and vanishing at p and q), and hence 


(11.11) (HIN. 


This vanishing of the tangential curvature of yo is the usual geodesic equation for 
a hypersurface in R"*+1, 

We proceed to derive from (11.11) an ODE in standard form. Suppose S' is 
defined locally by u(a) = C, Vu 4 0. Then (11.11) is equivalent to 
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(11.12) yo (t) = KVu(yo(t)), 


for a scalar KC that remains to be determined. But the condition that u(yo(t)) = C 
implies 
o(t) - Vu(yo(t)) = 0, 


and differentiating this gives 
(11.13) 70 (t) - Vu(yo(t)) = —r6(4) - D?u(yo(t)) - (4), 


where D?u is the matrix of second-order partial derivatives of u. Comparing 
(11.12) and (11.13) gives kK, and we obtain the ODE 


(1.14) 96 () = -[Vulrole)| ~ [6 - D2ulro(t)) - 24] VuCro(®) 


for a geodesic yo lying in S. 

We now want to parallel (11.6)-(11.11), to provide the ODE for a geodesic 
on 2 with a general Riemannian metric. As before, let +,(¢) be a one-parameter 
family of curves satisfying y,(a) = p, ys(b) = q, and (11.3). Then 


3) 
(11.15) V= 55 leno 


is a vector field defined on the curve y(t), vanishing at p and q, and a general 
vector field of this sort could be obtained by a variation +,(t). Let 


(11.16) Py). 


With the notation of (11.1), we have, parallel to (11.6), 


b 
Lg)= i V(T,T)*/? dt 
(11.17) . 
1 fe 
= x | V(T,T) dt, ats=0. 

Now we need a generalization of (0/0s)y,(t) and of the formula (11.7). One 
natural approach involves the notion of a covariant derivative. 

If X and Y are vector fields on Q, the covariant derivative VY is a vector 
field on Q. The following properties are to hold: We assume that V x Y is additive 
in both X and Y, that 


(11.18) VrxY = fVxy, 


for f € C™(Q), and that 
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(11.19) Vx(fY) =fVxY + (Xf)y 


(i.e., Vx acts as a derivation). The operator V x is required to have the following 
relation to the Riemannian metric: 


(11.20) X(Y,Z) = (VxY,Z) + (Y,VxZ). 
One further property, called the “zero torsion condition,” will uniquely specify V: 
(11.21) VxY —VyX =[X,Y]. 


If these properties hold, one says that V is a “Levi—Civita connection.” We have 
the following existence result. 


Proposition 11.1. Associated with a Riemannian metric is a unique Levi—Civita 
connection, given by 


2(VxY, Z) =X(Y,Z) + V(X, Z) — Z(X,Y) 


(11.22) 
+ ([X, Y], Z) _ ([X, Z\,¥) > bs Z|, X). 


Proof. To obtain the formula (11.22), cyclically permute X, Y, and Z in (11.20) 
and take the appropriate alternating sum, using (11.21) to cancel out all terms 
involving V but two copies of (Vx Y, Z). This derives the formula and establishes 
uniqueness. On the other hand, if (11.22) is taken as the definition of Vx Y, then 
verification of the properties (11.18)-(11.21) is a routine exercise. 


We can resume our analysis of (11.17), which becomes 

1 b 

(11.23) L'(s) = mal (VvT,T) dt, ats=0. 
Co Ja 

Since 0/0s and 0/0t commute, we have [V, T] = 0 on 7, and (11.21) implies 
1 b 

(11.24) L'(s) = ~ | (VrV,T) dt, ats =0. 
Co Ja 

The replacement for (11.7) is 

(11.25) TV,T) = (VrV,T) + (V,VrT), 


so, by the fundamental theorem of calculus, 


b 
(11.26) LQ) =-= f (V, VT) dt. 
OJa 
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If this is to vanish for all smooth vector fields over yo, vanishing at p and qg, we 
must have 


(11.27) VrT =0. 


This is the geodesic equation for a general Riemannian metric. 
If 2 Cc R” carries a Riemannian metric g;,(x) and a corresponding Levi-— 
Civita connection, the Christoffel symbols T* ij are defined by 


(11.28) Vali= > Tee 
k 


where D;, = 0/0x,. The formula (11.22) implies 


1 | 0g; Og; 0q;; 
(11.29) i = [Sai Dik oa). 


We can rewrite the geodesic equation (11.27) for yo(t) = x(t) as follows. With 


x = (21,...,2n) and T = (z',..., £7), we have 
(11.30) 0=S > Vr(é'D:) = So fe De + °VrDi]. 
£ £ 


In view of (11.28), this becomes 
(11.31) +a) o* Ts, =0 


(with the summation convention). The standard existence and uniqueness theory 
applies to this system of second-order ODE. We will call any smooth curve sat- 
isfying the (11.27), or equivalently (11.31), a geodesic. Shortly we will verify 
that such a curve is indeed locally length-minimizing. Note that if T = +’(t), 
then T(T,T) = 2(V rT, T); so if (11.27) holds, +(¢) automatically has constant 
speed. 

For a given p € Q, the exponential map 


(11.32) Exp, :U 2 

is defined on a neighborhood U of 0 € R” = T,,Q by 
(11.33) Exp,(v) = (1), 

where 7, (t) is the unique constant-speed geodesic satisfying 


(11.34) yw(0) =p, (0) =». 
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Note that Exp,(tv) = 7(¢). It is clear that Exp,, is well defined and C° on 
a sufficiently small neighborhood U of 0 € R”, and its derivative at 0 is the 
identity. Thus, perhaps shrinking U, we have that Exp, is a diffeomorphism of 
U onto a neighborhood O of p in 2. This provides what is called an exponential 
coordinate system, or a normal coordinate system. Clearly, the geodesics through 
p are the lines through the origin in this coordinate system. We claim that in this 
coordinate system 


(11.35) I ix(p) = 0. 


Indeed, since the line through the origin in any direction aD; + bD, is a geodesic, 
we have 


(11.36) V (aD; +bD,) (aD; + bD,x) =0, atp, 
for all a, b € R and all 7, &. This implies 
(11.37) Vp;Dr = 9, at p for all 7, k, 


which implies (11.35). We note that (11.35) implies 0g;,/Oxe = 0 at p, in this 
exponential coordinate system. In fact, a simple manipulation of (11.29) gives 


OG; 


(11.38) Bij 


= Gmel je + Gril ke. 


As a consequence, a number of calculations in differential geometry can be sim- 
plified by working in exponential coordinate systems. 

We now establish a result, known as the Gauss lemma, which implies that a 
geodesic is locally length-minimizing. For a small, let ©, = {v € R”: |lv|| = 
a}, and let S, = Exp, (2a). 


Proposition 11.2. Any unit-speed geodesic through p hitting S, att = a is 
orthogonal to Sq. 


Proof. If yo(t) is a unit-speed geodesic, yo(0) = p, yo(a) = q € Sa, and V € 
T,Q is tangent to S,, there is a smooth family of unit-speed geodesics, y,(t), 
such that y,(0) = p and (0/Os)7s(a)| = V. Using (11.24) and (11.25) for 
this family, with 0 < t < a, since L(s) is constant, we have 


o- | T(V,T) dt = (V,7%4(a)), 


which proves the proposition. 


Though a geodesic is locally length-minimizing, it need not be globally length- 
minimizing. There are many simple examples of this, some of which are discussed 
in the exercises. 
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We next consider a “naive” alternative to the calculations (11.17)—(11.31), not 
bringing in the notion of covariant derivative, in order to compute L’(0) when 
L(s) is given by 


b 1/2 
(11.39) L(s) = / [oie (ws(t)) #2) #h(t)| ” at 


We use the notation T? = #3(t), V2 = (0/0s)x4(t)|,—0. Calculating in a spirit 
similar to that of (11.6), we have (with x = xo) 


(11.40) i= 1 fs, 2 sia) pk 4 bys dhe pee dt 
, co Jag ee as 8S ls=0 2 Oe; , 


Now, in analogy with (11.7), and in place of (11.25), we can write 


(11.41) 
d i pk O24 k j sk 2 O9;k yp jqpk 
5 (sx (2() VIF ) = 955 %3(t)|,oF + gjnV7E"(t) + T Bar - . 


Thus, by the fundamental theorem of calculus, 


1 re sth eg, © Poe 
(11.42) L’(0) = -— | [in Via" , pe 2sik yipk — 1y1 2Dke pepe dt, 
Co Ja OxXe 2 O 


vj 


and the stationary condition L’(0) = 0 for all variations of the form described 
before implies 


. 0g; 10 : 
(11.43) gjni* (t) = (as 5 a 
J 


Symmetrizing the quantity in parentheses with respect to k and @ yields the ODE 
(11.31), with I;, given by (11.29). 

Of the two derivations for the equations of (constant-speed) geodesics given 
in this section, the latter is a bit shorter and more direct. On the other hand, the 
slight additional complication of the first derivation paid for the introduction of 
the notion of covariant derivative, a fundamental object in differential geometry. 
As we will see in the next section, the methods of the second derivation are very 
flexible; there we consider a class of extremal problems, containing the problem 
of geodesics, and also containing problems giving rise to the equations of classical 
physics, via the stationary action principle. 

We now show that the geodesic flow equations can be transformed to a Hamil- 
tonian system. Let (g/*) denote the matrix inverse of (g,;,), and relate v € R” to 
€ € R" by 


(11.44) &; = gjn(x)up, ie, vj = g'*(a)Ex. 


Exercises 6l 


Define f(z,€) on Q x R” by 


(11.45) f(e,8) = 50" (@)Ge, 


as before using the summation convention. For a manifold M, (11.44) is a local 
coordinate expression of the Riemannian metric tensor, providing an isomorphism 
of TM with T* M, and (11.45) defines half the square norm on T* MV. Then the 
integral curves (x(t), €(t)) of Hy satisfy 


1 Ogi* 
(11.46) ry = gf * (a) Ex, &e= eae 


If we differentiate the first equation and plug in the second one for € k, We get 


(11.47) te = )o|-59%° +9 aio - see 
J 


and using £; = )° 9;x(x)a,, straightforward manipulations yield the geodesic 
equation (11.31), with Mik given by (11.29). 

We now describe a relatively noncomputational approach to the result just 
obtained. Identifying (x, v)-space and (x, €)-space via (11.44), let Y be the result- 
ing vector field on (a, €)-space defined by the geodesic flow. The result we want 
to reestablish is that Y and H coincide at an arbitrary point (79, ) € 2 x R”. 
We will make use of an exponential coordinate system centered at x9; recall that 
in this coordinate system the geodesics through x9 become precisely the lines 
through the origin. (Of course, geodesics through nearby points are not gener- 
ally straight lines in this coordinate system.) In such a coordinate system, we 
can arrange g/*(aq) = 6/* and, by (11.35), (Og/"/Oae)(xo) = 0. Thus, if 
€0 = (a1,..., Gy), using (11.46) we have 


(11.48) Hy(x0,€0) = Yao Y (x0, €0) 


in this coordinate system. The identity of Hy and Y at (a, & ) is independent 
of the coordinate system used, so our result is again established. Actually, there 
is a little cheat here. We have not shown that Hy is defined independently of the 
choice of coordinates on 2. This will be established in $14; see (14.15)-(14.19). 

In the next section there will be a systematic approach to converting variational 
problems to Hamiltonian systems. 


Exercises 


1. We compare the length functional (11.2) to the energy functional, 


=5f Were. 
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Show that, if ||-y4(¢)|| = co is constant, then 


d 


1d 
55h) |.=0 — 


— —E(ys . 
co ds (y =o 


Hence a constant-speed geodesic is also a critical point of the energy functional. 


In Exercises 2-4, we extend the setting of the first approach to geodesics, in (11.3)- 
(11.14), from an n-dimensional surface S C R”*' to an n-dimensional surface S C 
R*,k>n. 

2. Suppose ys[a, b] > S, ys(a) = p, ys(b) = g. Show that 


FBO =~ f VO. W(H) at 


where 


V(t) (t)| 5 € Tyt)S- 


= >-7s 

Os 
Deduce that 7 = 7o is a critical point of the energy functional if and only if, for each 
t € (a,b), 


(11.49) y(t) L V(t), foreach V(t) € Ty@)S. 


3. Show that, whenever (11.49) holds, for all t € (a, b), it follows that ||-y’(€)|| is constant. 
Hence each critical point of the energy functional is a constant-speed geodesic. 


Hint. 0:(y'(t), Y' (0) = 2(7"(4), Y'()- 
4. Define P : S — M(k,R) by 


(11.50) P(x) =L projection of R* onto T;S. 
Then the criticality condition of Exercise 2 is 

(11.51) P(y(t))y"(t) =0, Vt € (a,b). 
Bringing in P+ (x) = I — P(«) and noting that 

(11.52) P* (y(t))y'(t) = 0, 
derive the geodesic equation 

(11.53) o"(t) + [DP* (9) 7 ly’ @ = 0, 


where 
DP* (x) : TS —+ M(k,R). 


Hint. To start, apply d/d¢t to (11.52) Then add the result to (11.51). 
5. Suppose S is a smooth n-dimensional surface in R*, with the induced Riemannian 
metric, and V°* is its Levi-Civita connection. Show that, for X and Y tangent to S, 


(11.54) VY =P(x)DxY, atxeS, 
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where Dx acts componentwise on Y and P(x) is the orthogonal projection (11.50). 
Hint. Verify that the right side of (11.54) satisfies the conditions (11.18)—(11.21) that 
define the Levi-Civita connection. 

6. Suppose Exp, : Ba — M isa diffeomorphism of Ba = {v € TM : ||v|| < a} onto 
its image, B. Use the Gauss lemma to show that, for each gq € B, q = Exp(w), the 
curve y(t) = Exp(tw), 0 < t < 1, is the unique shortest path from p to q. If Exp, is 
defined on B, but is not a diffeomorphism, show that this conclusion does not hold. 

7. Let M be a connected Riemannian manifold. Define d(p,q) to be the infimum of 
lengths of smooth curves from p to g. Show that this makes M a metric space. 

8. Let p,q € M, and suppose there exists a Lipschitz curve y : [a,b] — M, 7(a) = 
p, y(b) = q, parameterized by arc length, of length equal to d(p, q). Show that 7 is a 
C@-curve. (Hint: Make use of Exercise 6.) 

9. Let M be a connected Riemannian manifold that, with the metric of Exercise 7, is 
compact. Show that any p,q € M can be joined by a geodesic of length d(p, q). 

(Hint: Let yx : [0,1] ~ M, yx (0) = p, yx(1) = ¢ be constant-speed curves of lengths 
Ly — d(p,q). Use Ascoli’s theorem to produce a Lipschitz curve of length d(p, q) asa 
uniform limit of a subsequence of these.) 

10. Try to extend the result of Exercise 9 to the case where M is assumed to be complete, 
rather than compact. 

11. Verify that the definition of Vx given by (11.22) does indeed provide a Levi—Civita 
connection, having properties (11.18)—(11.21). 
(Hint: For example, if you interchange the roles of Y and Z in (11.22), and add it to the 
resulting formula for 2(Y, Vx Z), you can cancel all the terms on the right side except 
X(Y,Z) + X(Z,Y); this gives (11.20).) 
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The calculus of variations consists of the study of stationary points (e.g., maxima 
and minima) of a real-valued function that is defined on some space of func- 
tions. Here, we let M be a region in R”, or more generally an n-dimensional 
manifold, fix two points p,q € M and an interval [a,b] C R, and consider a 
space of functions P consisting of smooth curves u : [a,b] — M satisfying 
u(a) = p, u(b) = q. We consider functions I : P > R of the form 


b 
(12.1) I(u) = / F (u(t), w(t)) dt. 


Here F(x, v) is a smooth function on the tangent bundle T /, or perhaps on some 
open subset of TM. By definition, the condition for J to be stationary at wu is that 


(12.2) — I(us)|,_5 = 0 


for any smooth family u, of elements of P with up = u. Note that 
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d 

(12.3) A us(t)|,_9 = w(e) 
defines a tangent vector to M at u(t), and precisely those tangent vectors w(t) 
vanishing at t = a and at t = b arise from making some variation of wu within P. 

As in the last section, we can compute the left side of (12.2) by differenti- 
ating under the integral, and obtaining a formula for this involves considering 
t-derivatives of w. Recall the two approaches to this taken in §11. Here we will 
emphasize the second approach, since the data at hand do not generally pick out 
some distinguished covariant derivative on /. Thus we work in local coordinates 
on M. Since any smooth curve on M can be enclosed by a single coordinate 
patch, this involves no loss of generality. Then, given (12.3), we have 


s= 


b 
(12.4) = 1(us)| = [F.(u, uw + Fy(u, u)w] dt. 


Integrating the last term by parts and recalling that w(a) and w(b) vanish, we see 
that this is equal to 


d 
(12.5) / [Fe(u, tt — 5 Fo(u, tt) }w de, 


It follows that the condition for wu to be stationary is precisely that wu satisfy the 
equation 


2 a a 


(12.6) P 


a second-order ODE, called Lagrange’s equation. Written more fully, it is 

(12.7) Fyy(u, vit Fye(u, uu — F,(u,u) = 0, 

where F’,, is the m x m matrix of second-order v-derivatives of F(x, v), acting 
on the vector ti, etc. This is a nonsingular system as long as F(x, v) satisfies the 
condition 


(12.8) Fy (x, v) is invertible, 


as ann x n matrix, for each (x, v) = (u(t), u(t)), t € [a,b]. 
The ODE (12.6) suggests a particularly important role for 


(12.9) € = F,(z,v). 


Then, for (7, v) = (u, &), we have 
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(12.10) €=F,(2,v), =v. 
We claim that this system, in (x, €)-coordinates, is in Hamiltonian form. Note that 
(a, €) gives a local coordinate system under the hypothesis (12.8), by the inverse 
function theorem. In other words, we will produce a function E(x, &) such that 
(12.10) is the same as 
(12.11) t=E;, €=—Ep, 
so the goal is to construct E(#, €) such that 


(12.12) E,(«,§) = —F;,(z,v), Ee¢(a, €) =v, 


when v = vu (2, €) is defined by inverting the transformation 


(12.13) (x, €) = (x, F,(z,v)) = A(z, v). 
If we set 
(12.14) E?(2,v) = E(X(2,v)), 


then (12.12) is equivalent to 
(12.15) Ee (2,v)=—F,+uFye, E®(2,v)=0 Fe, 
as follows from the chain rule. This calculation is most easily performed using 


differential forms, details on which can be found in the next section; in the differ- 
ential form notation, our task is to find £°(x,v) such that 


(12.16) dE> = (—F, + vFyz) dx + uF yy dv. 

It can be seen by inspection that this identity is satisfied by 

(12.17) E*(x,v) = F,(z,v)u — F(z,v). 

Thus the ODE (12.7) describing a stationary point for (12.1) has been converted to 


a first-order Hamiltonian system, in the (x, €)-coordinates, given the hypothesis 
(12.8) on Fy. In view of (12.13), one often writes (12.17) informally as 


E(x, ) = €-v— F(z, v). 
We make some observations about the transformation A of (12.13). If vu € 


T,M, then F(a, v) acts naturally as a linear functional on T;,M. In other words, 
€ = F(z, v) is naturally regarded as an element of T* M, in the cotangent bundle 
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of M; it makes invariant sense to regard 
(12.18) \:TM —T*M 


(if F’ is defined on all of T’/). This map is called the Legendre transformation. 
As we have already noted, the hypothesis (12.8) is equivalent to the statement that 
is a local diffeomorphism. 

As an example, suppose / has a Riemannian metric g and 


F(a,v) = 5(0-0): 


Then the map (12.18) is the identification of TM and T* M associated with “low- 
ering indices,” using the metric tensor g;;,. A straightforward calculation gives, in 
this case, F’(a, €) equal to half the natural square norm on cotangent vectors. On 
the other hand, the function F'(2,v) = ./g(v,v) fails to satisfy the hypothesis 
(12.8). Since this is the integrand for arc length, it is important to incorporate this 
case into our analysis. Recall from the previous section that obtaining equations 
for a geodesic involves parameterizing a curve by arc length. We now look at the 
following more general situation. 

We say F'(x,v) is homogeneous of degree r in v if F(x, cv) = c’ F(x, v) for 
c > 0. Thus \/g(v,v) above is homogeneous of degree 1. When F’ is homo- 
geneous of degree 1, hypothesis (12.8) is never satisfied. Furthermore, I(u) is 
independent of the parameterization of a curve in this case; if a : [a,b] — [a, b] is 
a diffeomorphism (fixing a and 6), then J(u) = I(u) for u(t) = u(o(t)). Let us 
look at a function f(a, v) related to F(x, v) by 


(12.19) f(x, v) = (F(z, v)), F(z,v) = g( f(z, v)). 


Given a family wu, of curves as before, we can write 


d b 

1 (ws) ee — yg! f (u,v) fr (u, tt) 
(12.20) ds : y | 
If wu satisfies the condition 


(12.21) flu,u) =c, 


with c constant, this is equal to 


b 
(12.22) cf [fa (u, te) — (d/dt) f,(u, %)] w dt, 
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with c’ = y’(c). Of course, setting 


b 
(12.23) J(u) -| f(u,w) dt, 
we have 
d : d 
(12.24) = F(tis)| a0 =} [fe(u, %) — = foluytt)}w dt. 


Consequently, if u satisfies (12.21), then wu is stationary for J if and only if wu is 
stationary for J (provided y’(c) # 0). 

It is possible that f(x, v) satisfies (12.8) even though F(x, v) does not, as the 
case F'(x,v) = \/g(v, v) illustrates. Note that 


fojue = CP Poca, + OP Fy, Fog: 


Let us specialize to the case W(F) = F?, so f(x,v) = F(x,v)? is homoge- 
neous of degree 2. If F’ is convex in v and (Foye > @ positive-semidefinite matrix, 
annihilates only radial vectors, and if F > 0, then f(z, v) is strictly convex (i.e., 
fov is positive-definite), and hence (12.8) holds for f(a, v). This is the case when 
F(a,v) = /g(v, v) is the arc length integrand. 

If f(x, v) = F(x, v)? satisfies (12.8), then the stationary condition for (12.23) 
is that u satisfy the ODE 


fou(u, Wit + foa(u, Wu — fr(u, &) = 0, 


a nonsingular ODE for which we know there is a unique local solution, with 
u(a) = p, u(a) given. We will be able to say that such a solution is also sta- 
tionary for (12.1) once we know that (12.21) holds, that is, f(u, t) is constant. 
Indeed, if f(x,v) is homogeneous of degree 2, then f,(x,v)u = 2f(ax,v), and 
hence 


(12.25) e°(x, v) = fy(a, v)u — f(x,v) = f(z, v). 


But since the equations for u take Hamiltonian form in the coordinates (x, £) = 
(x, fu(a,v)), it follows that e°(u(t), u(t) is constant for u stationary, so (12.21) 
does hold in this case. 

There is a general principle, known as the stationary action principle, or 
Hamilton’s principle, for producing equations of mathematical physics. In this 
set-up, the state of a physical system at a given time is described by a pair (x, v), 
position and velocity. One has a kinetic energy function T(x,v) and a poten- 
tial energy function V(a,v), determining the dynamics, as follows. Form the 
difference 
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(12.26) L(a,v) = T(a2,v) — V(2,v), 


known as the Lagrangian. Hamilton’s principle states that a path u(t) describing 
the evolution of the state in this system is a stationary path for the action integral 


b 
(12.27) I(u) = / L(u, is) dt. 


In many important cases, the potential V = V(x) is velocity independent and 
T(x, v) is a quadratic form in v; say T(x,v) = (1/2)uv- G(x)v for a symmetric 
matrix G(x). In that case, we consider 


(12.28) L(x,v) = 5 -G(a)u — V(2). 
Thus we have 
(12.29) €=L,(a,v) = G(a)v, 


and the conserved quantity (12.17) becomes 


E?(x,v) =v-G(x)v— [5 -G(a)u — V(a) 
(12.30) ‘ 2 
=a G(a)u+ V(2), 

which is the total energy T(x, v)+V (a). Note that the nondegeneracy condition is 
that G(x) be invertible (in physical problems, G'() is typically positive-definite, 
but see (19.20)); assuming this, we have 


(12.31) B(a,£) = 5€- Gla) 1€ + V(2), 


whose Hamiltonian vector field defines the dynamics. Note that, in this case, 
Lagrange’s equation (12.6) takes the form 


[Gui] = Fit Ga (u)it — Valu), 


(12.32) 
which can be rewritten as 


(12.33) i +Tud + G(u)~'V2(u) = 0, 


where [wz is a vector whose ¢th component is 1 jul u*, with I jk the connec- 
tion coefficients defined by (11.29) with (g;~) = G(x). In other words, (12.33) 
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generalizes the geodesic equation for the Riemannian metric (gj) = G(x), which 
is what would arise in the case V = 0. 

We refer to [Ar] and [Go] for a discussion of the relation of Hamilton’s princi- 
ple to other formulations of the laws of Newtonian mechanics, but we will briefly 
illustrate it here with a couple of examples. 

Consider the basic case of motion of a particle in Euclidean space R”, in the 
presence of a force field of potential type F'(x) = — grad V(z), as in the begin- 
ning of §10. Then 


(12.34) T(x,v) = sul, V(z,v) = V(z). 


This is of course the special case of (12.28) with G(x) = mI, and the ODE 
satisfied by stationary paths for (12.27) hence has the form 


(12.35) mi + Vz(u) = 0, 
precisely the (10.2) expressing Newton’s law F = ma. 

Next we consider one example where Cartesian coordinates are not used, 
namely the motion of a pendulum (Fig. 12.1). We suppose a mass m is at the 
end of a (massless) rod of length @, swinging under the influence of gravity. In 
this case, we can express the potential energy as 
(12.36) V(0) = —mgé cos 6, 
where 6@ is the angle the rod makes with the downward vertical ray, and g denotes 


the strength of gravity. The speed of the mass at the end of the pendulum is @ iI, 
so the kinetic energy is 


; 1 . 
(12.37) T(O,8) = smell. 

In this case we see that Hamilton’s principle leads to the ODE 
(12.38) (6 + gsind =0, 


describing the motion of a pendulum. 


FIGURE 12.1 The Pendulum 
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Next we consider a very important physical problem that involves a velocity- 
dependent force, leading to a Lagrangian of a form different from (12.28), namely 


the (nonrelativistic) motion of a charged particle (with charge e) in an electromag- 
netic field (E', B). One has Newton’s law 


(12.39) m—=Ff 
where v = dx/dt and F is the Lorentz force, given by 


(12.40) F=e(E+vxB). 


Certainly F' here is not of the form —VV (<x). To construct a replacement for the 
potential V, one makes use of two of Maxwell’s equations for F and B: 


OB . 
(12.41) curl F = — OE? divB=0, 


in units where the speed of light is 1. We will return to Maxwell’s equations later 
on. As we will show in §19, these equations imply the existence of a real-valued 
y(t, x) and a vector-valued A(t, x) such that 


(12.42) B=curlA, E=-— grady— ee 


Given these quantities, we set 

(12.43) V(a,v) =e(p—A-v), 

and use the Lagrangian L = T — V, with T = (1/2)ml|v|?. We have 
Ly=mv+eA, Ly = —ep, +e grad(A-v). 


Consequently, (d/dt)L, = m dv/dt + e0A/O0t + eA,v. Using (12.42), we can 
obtain 


d dv 
(12.44) Glv — Le =m &—e(B+0x curl A), 


showing that Lagrange’s equation 


(12.45) 


is indeed equivalent to (12.39)-(12.40). 
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If the electromagnetic field varies with t, then the Lagrangian L produced by 
(12.43) has explicit t-dependence: 


(12.46) L= L(t,2,v). 


The equation (12.45) is still the stationary condition for the integral 


b 
(12.47) oe / L(t, u(t), w(t) dt, 


as in (12.6). Of course, instead of (12.7), we have 
(12.48) Luv(t, u, uti + Doo (t, u, uw) — De (t, u, u) + Liv (t, u, u) = 0. 


Finally, we note that for this Lorentz force the Legendre transformation (12.13) 
is given by 


(12.49) (a,€) = (a,mv + eA), 


and hence the Hamiltonian function E(x, €) as in (12.11) is given by 
1 2 
(12.50) E(a,€) = ——|€ — eA|* + ey. 
2m 


A treatment of the relativistic motion of a charged particle in an electro- 
magnetic field (which in an important sense is cleaner than the nonrelativistic 
treatment) is given in §19. 

Hamilton’s principle can readily be extended to produce partial differential 
equations, describing the motion of continua, such as vibrating strings, moving 
fluids, and numerous other important phenomena. Some of these results will be 
discussed in the beginning of Chap. 2, and others in various subsequent chapters. 

We end this section by noting that Lagrange’s equation (12.6) depends on the 
choice of a coordinate system. We can write down an analogue of (12.6), which 
depends on a choice of Riemannian metric on //, but not on a coordinate system. 

Thus, let 17 be a Riemannian manifold, and denote by V the Levi—Civita con- 
nection constructed in §11. If we have a family of curves in TM, that is, a map 


(12.51) u:IxI—+M, u=u(t,s), 


with velocity uz: J x I + T'M, we can write 


(12.52) I(s) = [ F (u(t, s)) dt, 
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for a given F': TM — R. We have 
b 
(12.53) ii(s\= / DF (u(t, 8)) Osu dt. 


Note that DF (uz) acts on O.uz € Ty, (TM). Now, given v € TM, we can write 
(12.54) T,(TM) = V,(TM) @ H,(TM). 


Here the “vertical” space V,(T'M) is simply T,(T7(,)M), where 7: TM — 
M is the usual projection. The “horizontal” space H,,(T’M) is a complementary 
space, isomorphic to T’,(,) M, defined as follows. 

For any smooth curve 7 on M, such that y(0) = x = m(v), let V(t) € Ty) M 
be given by parallel translation of v along ¥, that is, if T = y(t), V solves 
VrV = 0, V(0) = v. Thus V(t) is a curve in TM, and V(0) = v. The map 
7'(0) ++ V"(0) is an injective linear map of T,(,,)M into T, (TM), whose range 
we call H,(TM). One might compare the construction in 86 of Appendix C, 
Connections and Curvature. Thus we have both the decomposition (12.54) and 
the isomorphisms 


(12.55) Vi(TM) © TryyM, Hy(TM) & Try) M. 
The first isomorphism is canonical. The second isomorphism is simply the restric- 


tion of Dr : T,(1M) — T,(,)M to the subspace H,(T'M). 
The splitting (12.54) gives 


(12.56) DF(v)(Osut) — (Fy (v), (Os Ut) vert) + (F,(v), (Os Ut) horiz) 5 
where we use this to define 
(12.57) Fy(v) € Tr (y)M xVi(TM), F,(v) € Te (v) M =~ H,(TM). 


If we set v = uz, W = Us, We have 


(12.58) I'(s) = [ [(Fo(u), Vow) + (Fe(v), w)] dt. 


Parallel to (11.24)-(11.26), we have 


b 


(12.59) [ (Pole. om) wa -{ (ViFy(ut),w) dt, 


a 


where to apply V, we regard F,,(u;) as a vector field defined over the curve 
t+ u(t, s) in M. Hence the stationary condition that 1’(0) = 0 for all variations 
of u(t) = u(t, 0) takes the form 
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(12.60) VuFy(t) _ F(t) = 0. 


Note that if u(s) is a smooth curve in TM, with m(v(s)) = u(s) and u/(s) = 
w(s), then, under the identification in (12.55), 


(12.61) u'(s)vert = Vw, U'(8)noriz = W- 
Then, for smooth F: TM — R, 


d 


(12.62) qr (¥(s)) = (F,(v), Vou) + (Fe(v), w). 
In particular, 
(12.63) F(v) = (v, v) F,(v) = 2v and F,(v) = 0. 


Thus, for this function F(v), the Lagrange equation (12.60) becomes the 
geodesic equation V,v = 0, as expected. If, parallel to (12.28), we take 
L(v) = (1/2)(v, v) — V(x), & = x(v), then 

(12.64) L,(v) =v, L,(v) = — grad V(2), 


where grad V(x) is the vector field on M defined by (grad V(2),W) = 
LwV (x). The Lagrange equation becomes 


(12.65) Viti + grad V(u) =0, 


in agreement with (12.33). 


Exercises 
1. Suppose that, more generally than (12.28), we have a Lagrangian of the form 
1 
L(a,v) = gU" G(x)u + A(xz)-v— V(x). 
Show that (12.30) continues to hold, that is, 
b 1 
E’(z,v) = 50° G(«)v+V(a), 
and that the Hamiltonian function becomes, in place of (12.31), 
1 = 
E(x, £) = 5(6 — A(z))- G(z) *(€- A(x)) + V(z). 


Work out the modification to (12.33) when the extra term A() - vu is included. Relate 
this to the discussion of the motion in an electromagnetic field in (12.39)—(12.50). 


74 


1. Basic Theory of ODE and Vector Fields 


mass m, 


mass mz 


FIGURE 12.2 The Double Pendulum 


. Work out the differential equations for a planar double pendulum, in the spirit of 


(12.36)-(12.38). See Fig. 12.2. (Hint: To compute kinetic and potential energy, think 
of the plane as the complex plane, with the real axis pointing down. The position of 
particle 1 is £,e'*! and that of particle 2 is 0; 91 + bye?) 


. After reading §19, show that the identity F = dA in (19.19) implies the identity 


(12.42), with A = y dro + 0,3, Aj dx;. 


. If A(x) is a vector field on R? and v is a constant vector, show that 


grad (v- A) =V,A+v x curl A. 


Use this to verify (12.44). How is the formula above modified if v = v(a) is a function 
of x? Reconsider this last question after looking at the exercises following §8 of Chap. 5. 


. The statement before (12.4)-that any smooth curve u(s) on M can be enclosed by a 


single coordinate patch—is not strictly accurate, as the curve may have self-intersections. 
Give a more precise statement. 


13. Differential forms 


It is very desirable to be able to make constructions that depend as little as possible 
on a particular choice of coordinate system. The calculus of differential forms, 
whose study we now take up, is one convenient set of tools for this purpose. 


We start with the notion of a 1-form. It is an object that is integrated over a 


curve; formally, a 1-form on Q C R” is written 


(13.1) a= S° a;(a) da;. 
g 


If 7 : [a,b] — Q is a smooth curve, we set 


b 
(13.2) fo=] S © a; (y(t) 7 (t) dt. 
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In other words, 
(13.3) Je = [ve 
Y yf 


where I = [a,b] and y*a = 97, a;(y(t))7;(¢) is the pull-back of a under the 


map +. More generally, if F : O — Q is a smooth map (O C R”™ open), the 
pull-back Fa is a 1-form on O defined by 


(13.4) ra= 5 aj(F(y)) 5 dys. 


The usual change of variable for integrals gives 


(13.5) fo= [Fe 
Y o 


if y is the curve Foo. 
If F : O > Q is a diffeomorphism, and 


(13.6) X= Lem 
zi) 


is a vector field on Q, recall that we have the vector field on O: 
(13.7) FyX(y) = (DF-"(p))X(p), p= Fly). 


If we define a pairing between 1-forms and vector fields on 2 by 


(13.8) (X,a) = SS b} (x)a;(x) = b-a, 


a simple calculation gives 
(13.9) (Fy X, F*a) = (X,a)o F. 
Thus, a 1-form on (2 is characterized at each point p € (2. as a linear transformation 
of vectors at p to R. 

More generally, we can regard a k-form a on ( as a k-multilinear map on 
vector fields: 


(13.10) Aico) EC (Q) 


we impose the further condition of antisymmetry: 
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(13.11) 
a(Xy,...,X5,...,Xe,...,X~) = —a(X1,...,Xe,...,X5,..., Xe). 
We use a special notation for k-forms: If 1 < 7, < +--+ < jp Sn, jg = 

(Fipe+ +4 Ie), WE Set 

(13.12) a=) a;(x) day, A+++ Ada;,, 

j 
where 
0 


More generally, we assign meaning to (13.12) summed over all k-indices (j1,... 
jr), where we identify 


(13.14) dz;, \-+-Adx;, = (sgna) 2 ay NOt ID, 5 


o being a permutation of {1,...,k}. If any jm = je (m # 24), then (13.14) 
vanishes. A common notation for the statement that a is a k-form on (0 is 


(13.15) a € A*(Q). 
In particular, we can write a 2-form (3 as 
(13.16) B= S— dyx(x) da; A dap 


and pick coefficients satisfying b;,(”) = —b,;(a). According to (13.12) and 
(13.13), if we set U = }) u,;(x) 0/Ox,; and V = 9° vu; (x) 0/dz,, then 


(13.17) BU, V) = 2 © djx(a)u (x)v* (2). 
If b;;, is not required to be antisymmetric, one gets 3(U, V) = 7 (bjx —dxj) wiv". 


If F : O — Q is a smooth map as above, we define the pull-back F*a of a 
k-form a, given by (13.12), to be 


(13.18) Fra=)-a;(F(y))(F*da;,) A+++ A (F*da;,), 
J 


where 


OF; 
(13.19) Fda; =)\° > dy, 
2 Oye 
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the algebraic computation in (13.18) being performed using the rule (13.14). 
Extending (13.9), if Fis a diffeomorphism, we have 


(13.20) (F*a)(FeX1,...,F eX) = 0(X1,...,X,) oF. 


If B = (b;,) is ann x m matrix, then, by (13.14), 


( die di.) A 62 bsg dix.) oer (X ae dix) 


= (doen )b16(1)ba0(2)*°* Pno(n)) dz, \-++A dan 


o 


= (det B) dx, A\---Ad&n, 


(13.21) 


Hence, if F : O + (Q is a C!-map between two domains of dimension n, and 
a= A(x) dz, \--+ A dx, is an n-form on Q, then 


(13.22) F*a = det DF (y) A(F(y)) dy1 A+++ A dyn. 


Comparison with the change-of-variable formula for multiple integrals sug- 
gests that one has an intrinsic definition of ie a@ when a is an n-form on Q, n = 
dim . To implement this, we need to take into account that det DF'(y) rather than 
| det DF(y)| appears in (13.21). We say that a smooth map F' : O > (Q between 
two open subsets of IR” preserves orientation if det DF (y) is everywhere posi- 
tive. The object called an “orientation” on Q can be identified as an equivalence 
class of nowhere-vanishing n-forms on (2, where two such forms are equivalent if 
one is a multiple of another by a positive function in C™ ((Q); the standard orien- 
tation on R” is determined by dx; A --- A day. If S is an n-dimensional surface 
in R"**, an orientation on S can also be specified by a nowhere-vanishing form 
w € A"(S). If such a form exists, S is said to be orientable. The equivalence 
class of positive multiples a(a)w is said to consist of “positive” forms. A smooth 
map 7 : S —> M between oriented n-dimensional surfaces preserves orientation 
provided ~*o is positive on S whenever a € A"(M) is positive. If S' is oriented, 
one can choose coordinate charts that are all orientation-preserving. Surfaces that 
cannot be oriented also exist. 

If O,Q are open in R” and F’ : O — (is an orientation-preserving diffeo- 
morphism, we have 


(13.23) [re- Jo 
oO Q 


More generally, if S is an n-dimensional manifold with an orientation, say the 
image of an open set O C R” by y : O — S, carrying the natural orientation of 
O, we can set 
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(13.24) [a= [ere 
Ss (2) 


for an n-form a on S. If it takes several coordinate patches to cover S, define f, ga 
by writing a as a sum of forms, each supported on one patch. 

We need to show that this definition of [ g @ is independent of the choice of 
coordinate system on S (as long as the orientation of S is respected). Thus, sup- 
poseey:O>4U Cc Sandy: Q—U C S are both coordinate patches, so that 
F=y7y- tow: O- Qisan orientation-preserving diffeomorphism. We need to 
check that if a is an n-form on S, supported on U, then 


(13.25) [ea= fur 
2) Q 


To see this, first note that, for any form a of any degree, 
(13.26) poF=p=>> pra=F* ira. 


It suffices to check this fora = dx;. Then y* dx; = S°>(Ow,;/Oxe) day, by 
(13.14), so 


OF, Oy 


13.27) F*p* dx; = Y> —* 
( ) pr ra OLm OxXe 


fa) : 
dtm, g* de;=>_ = dom; 


but the identity of these forms follows from the chain rule: 


Oy; Ow; OF e 
13.28 Dy = (Dv)\(DF) => 2 = tie 
(13.28) p= (Db)(DF) = a DB: Ben 
Now that we have (13.26), we see that the left side of (13.25) is equal to 
(13.29) [Free 
O 


which is equal to the right side of (13.25), by (13.23). Thus the integral of an 
n-form over an oriented n-dimensional surface is well defined. 

Having discussed the notion of a differential form as something to be inte- 
grated, we now consider some operations on forms. There is a wedge product, or 
exterior product, characterized as follows. If a € A*(Q) has the form (13.12), 
and if 


(13.30) B= S_bj(2) dai, A--- Ada, € AX(Q), 
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define 
(13.31) aN B= > a(n) be) da, A>» N day, Ada, A 6s A diy, 
jut 
in APP), A special case of this arose in (13.18)—-(13.21). We retain the equiv- 
alence (13.14). It follows easily that 
(13.32) al B= (-1)"’8 Aa. 


In addition, there is an interior product if « € A*(Q) with a vector field X on 
Q, producing txa = a| X € A*—1(Q), defined by 


(13.33) (a|X)(X1,...,Xp-1) = a(X, X1,..., Xp-1)- 
Consequently, if a = dz;, \--- A dx;,, Dj = 0/02;, then 
(13.34) a|D;, = (—1)'"1 da; A+++ A day, N+ Ndx;,, 
where da 'j¢ denotes removing the factor dx ;,. Furthermore, 

t€ {h1,---,nr} => al D; =0. 


If F : O > Y is a diffeomorphism and a, 3 are forms and X a vector field on 
Q, it is readily verified that 


(13.35) F*(aA 8B) = (F*a) A (F*B),  F*(a|X) = (F*a)|(F2X). 
We make use of the operators A; and vu, on forms: 

(13.36) Apa = dap Na, tera = a|Dz. 

There is the following useful anticommutation relation: 

(13.37) Ale + leAk = Ore, 


where dx¢ is 1 if k = 2, 0 otherwise. This is a fairly straightforward consequence 
of (13.34). We also have 


(13.38) Aj Ap tg Aj = 0, bgle + bey = 0. 
From (13.37) and (13.38) one says that the operators {1;,A; : 1 < 7 < n} 


generate a “Clifford algebra.” For more on this, see Chap. 10. 
Another important operator on forms is the exterior derivative: 


(13.39) d: A*(Q) —> A***(Q), 
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defined as follows. If a € A*(Q) is given by (13.12), then 


fa) : 
(13.40) de = S75 doe N dig, Av A db ;_. 
j£ 
Equivalently, 
(13.41) da = S~ de Aca, 
f=1 


where Op = O/Oxz¢ and A; is given by (13.36). The antisymmetry dx, A dxe = 
—dxy\ dm, together with the identity 0?a;/OxpO2p, = 07a; /OX,0x¢, implies 


(13.42) d(da) = 0 

for any differential form a. We also have a product rule: 

(13.43) d(aA B) = (da) \B+(-1)*a A (dB), ae A*(Q), Be AM(Q). 
The exterior derivative has the following important property under pull-backs: 

(13.44) F* (da) = dF*a, 


if a € A¥(Q) and F : O + Q is a smooth map. To see this, extending (13.43) to 
a formula for d(a A 3, A--+ A Ge) and using this to apply d to F’*a, we have 


(13.45) 
* O * * 
dF*a = 2 Ba jo F(x)) day A (F*dz;,) A+++ A (F*da;,) 
+S" (£)a;(F(2)) (F*daj,) A+++ Ad(F*da;,) A--- A (F*da;,). 
jy 
Now 


d(F*dz;) = >~ os cephisp se 
‘ “— OxjOx, ~” : 
so only the first sum in (13.45) contributes to dF*a. Meanwhile, 


ag: 


Da, | 


(13.46) F*da = y F(a)) (F*dam) \ (F*dx;,) \«++ A (F*da;,), 


so (13.44) follows from the identity 
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Z 0a; 
(13.47) y Da, (2 40 F(a)) day = d, ec ) PP dati, 
which in turn follows from the chain rule. 

If da = 0, we say a is closed; if a = d@ for some 3 € A‘—1(Q), we say a 
is exact. Formula (13.42) implies that every exact form is closed. The converse is 
not always true globally. Consider the multivalued angular coordinate @ on R? \ 
(0,0); dé is a single-valued, closed form on R? \ (0,0) that is not globally exact. 
As we will see shortly, every closed form is locally exact. 

First we introduce another important construction. If a € A*(Q) and X is a 
vector field on , generating a flow F{,, the Lie derivative Lx a is defined to be 


d « 
(13.48) Lxea= ar x) alt—o- 
Note the formal similarity to the definition (8.2) of £Y for a vector field Y. 
Recall the formula (8.4) for £ Y. The following is not only a computationally 
convenient formula for £x a, but also an identity of fundamental importance. 


Proposition 13.1. We have 
(13.49) Lxa = d(a|X) + (da)|X 


Proof. First we compare both sides in the special case X = 0/Oxy = Dy. Note 
that 
(F3,) a= S- a;(x+ tee) dx;, \--- A dx;,, 
j 
so 


(13.50) Lp,a= > a dj, N+++ A da;, = Ova. 
j 


To evaluate the right side of (13.49) with X = Dz, use (13.41) to write this 
quantity as 


(13.51) d(tga) + eda = S°(9; Aj Le + Ue0j;Aj)o% 
j=l 


Using the commutativity of 0; with A; and with vz, and the anticommutation 
relations (13.37), we see that the right side of (13.51) is Oga, which coincides 
with (13.50). Thus the proposition holds for X = 0/0z». 

Now we can prove the proposition in general, for a smooth vector field X 
on 2. It is to be verified at each point ap € 2. If X (xo) 4 0, choose a coordinate 
system about 29 so that X = 0/02, and use the calculation above. This shows 
that the desired identity holds on the set of points {79 € Q : X(zo) 4 O}, 
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and by continuity it holds on the closure of this set. However, if 7g € 9 has a 
neighborhood on which X vanishes, it is clear that £xa@ = 0 near xg and also 
a|X and da|X vanish near x. This completes the proof. 


The identity (13.49) can furnish a formula for the exterior derivative in terms 
of Lie brackets, as follows. By (8.4) and (13.49), we have, for a k-form w, 


(13.52) 
(Lxw)(X1,..., Xp) =X -w(X,..., Xk) — w(K... [K, XG], -.-, Xe) 
j 


Now (13.49) can be rewritten as 
(13.53) txdw = Lyw — dixw. 


This implies 
(13.54) 
(dw)(Xo, X1, see Xp) = (Lx,w)(X1, cee Xp) — (dix,w)(X1, tee Xz). 


We can substitute (13.52) into the first term on the right in (13.54). In case w is a 
1-form, the last term is easily evaluated; we get 


(13.55) (dw)(Xo, X1) = Xo : w(X1) = Xy : w(Xo) < w([Xo,X4]). 
More generally, we can tackle the last term on the right side of (13.54) by the 


same method, using (13.53) with w replaced by the (& — 1)-form vx,w. In this 
way we inductively obtain the formula 


(13.56) 
k 
(dw)(Xo,...,Xz) = S>(-1)*Xe- w(Xo,..., Xe... Xe) 
l=0 
4. a carl(D. ete. © eC meant. (ener, Comment 20) 
O0<L<j<k 


Note that from (13.48) and the property a = FF it easily follows that 


(13.57) “(Fa = Lx (Fk) a = (Fk) "La. 


It is useful to generalize this. Let F; be any smooth family of diffeomorphisms 
from M to F;(M) C M. Define vector fields X; on F,(1) by 


(13.58) £ F(x) = X,(Fi(z)). 


Then it easily follows that, for a € A‘M, 
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(13.59) dt 
= Ff [d(a|X,) + (da) |X]. 


In particular, if a is closed, then if F; are diffeomorphisms for 0 < ¢ < 1, 
1 
(13.60) Fra-—Fja=dp, B= | FS (a|X;) dt. 
0 


Using this, we can prove the celebrated Poincaré lemma. 


Theorem 13.2. If B is the unit ball in R”, centered at 0, a € A*(B), k > 0, 
and da = 0, then a = d for some 3 € A*~1(B). 


Proof. Consider the family of maps F; : B — B given by F;(x) = ta. For 
0 <t <1, these are diffeomorphisms, and the formula (13.59) applies. Note that 


Flra=a, Foa=0. 
Now a simple limiting argument shows that (13.60) remains valid, soa = d@ 
with 
1 

(13.61) p= F*(a|V)t~* dt, 

0 
where V = r0/Or = 5) x; 0/0x;. Since Fj = 0, the apparent singularity in the 
integrand is removable. 


Since in the proof of the theorem we dealt with F; such that Fo was not a 
diffeomorphism, we are motivated to generalize (13.60) to the case where F;, : 
M — N isa smooth family of maps, not necessarily diffeomorphisms. Then 
(13.58) does not work to define X; as a vector field, but we do have 


d 


(13.62) a 


F,(x) = Z(t, 2); A(t, x) € Tray. 
Now in (13.60) we see that 
F*(a|X:)(", sey Y;,—1) = a(F,(2)) (Xt, DF, (2x)Yi, sey DF,(z)¥,-1), 


and we can replace X; by Z(t, x). Hence, in this more general case, if a is closed, 
we can write 


1 
(13.63) Fra-—Fja=d8, B= | ye dt, 
0 


where, atz € M, 
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(13.64) 
v(Yi,---,Ye-1) = a(Fi(x)) (Z(, 2), D(a), ..., DF¢(x)Yi-1). 


For an alternative approach to this homotopy invariance, see Exercise 7. 
A basic result in the theory of differential forms is the generalized Stokes 
formula: 


Proposition 13.3. Given a compactly supported (k — 1)-form f of class C 1 on 
an oriented k-dimensional manifold M (of class C?) with boundary 0M, with its 
natural orientation, 


(13.65) [u- i 
M OM 


The orientation induced on OM is uniquely determined by the following 
requirement. If 


(13.66) M =R* ={z € R* : x; < 0}, 


then OM = {(a2,...,2)} has the orientation determined by drz \--+ A dx x. 


Proof. Using a partition of unity and invariance of the integral and the exterior 
derivative under coordinate transformations, it suffices to prove this when M has 
the form (13.66). In that case, we will be able to deduce (13.65) from the funda- 
mental theorem of calculus. Indeed, if 


(13.67) GB =b,(x) day A+» A dx; A+++ A dag, 
with b;(x) of bounded support, we have 


(13.68) dB = (ay dz, \-++A d&g. 


j 


If 7 > 1, we have 
°° Ob; 
(13.69) fs =] / — dx;} dx’ =0, 
- { = Ox; i} 


and also «*3 = 0, where & : OM —> M is the inclusion. On the other hand, for 
j = 1, we have 
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° aby 
fo=f{ | ey dtr} de de 
M 


(13.70) = i; by(0, 2’) da’ 


= ‘| B. 
aM 


This proves Stokes’ formula (13.65). 


It is useful to allow singularities in 0M. We say a point p € M is a corner of 
dimension v if there is a neighborhood U of p in M and a C?-diffeomorphism of 
U onto a neighborhood of 0 in 


(13.71) K ={2e€R* : 2; <0, forl <j <k—v}, 


where k is the dimension of M. If M is a C?-manifold and every point p € 0M 
is a corner (of some dimension), we say M is a C?-manifold with corners. In such 
a case, 0M isa locally finite union of C?-manifolds with corners. The following 
result extends Proposition 13.3. 


Proposition 13.4. Jf M is a C?-manifold of dimension k, with corners, and (3 is 
a compactly supported (k — 1)-form of class C' on M, then (13.65) holds. 


Proof. It suffices to establish this when (@ is supported on a small neighborhood 
of acorner p € OM, of the form U described above. Hence it suffices to show that 
(13.65) holds whenever (3 is a (k — 1)-form of class C', with compact support on 
K in (13.71); and we can take 7 to have the form (13.67). Then, for 7 > k — v, 
(13.69) still holds, while for 7 < k — v, we have, as in (13.70), 


(13.72) 


; 0 @ab. oh 
fa=co {ff wir des} dry ++ dees ++ da 
K co 


J 
= (1) foj(ar,. ma 55~1,0; 27 415- 1 Be) dx 1 oie dang -- dx, 


= [ 6. 


OK 


The reason we required M to be a manifold of class C? (with corners) in 
Propositions 13.3 and 13.4 is the following. Due to the formulas (13.18)—(13.19) 
for a pull-back, if 3 is of class C! and F is of class C“, then F*3 is generally 
of class C¥, with « = min(j, 2 — 1). Thus, if 7 = ¢ = 1, F* might be only 
of class C°, so there is not a well-defined notion of a differential form of class 
C* on a C!-manifold, though such a notion is well defined on a C?-manifold. 
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This problem can be overcome, and one can extend Propositions 13.3 and 13.4 to 
the case where M is a C'-manifold (with corners) and (3 is a (k — 1)-form with 
the property that both @ and d@ are continuous. We will not go into the details. 
Substantially more sophisticated generalizations are given in [Fed]. 


Exercises 


1. If F : Uo — Uy and G : U; — U2 are smooth maps and a € A* (U2), (13.26) 
implies 
(Go F)*a = F*(G*a) in A*(Up). 
In the special case that U; = R”, F and G are linear maps, and k = n, show that this 
identity implies 
det(GF’) = (det F’)(det G). 
2. If wis aclosed form and (3 is exact, show that a A @ is exact. (Hint: Use (13.43).) 


Let A*(IR") denote the space of k-forms (13.12) with constant coefficients. If 
T : R™ — RX is linear, then T* preserves this class of spaces; we denote the 
map 

A‘T* : A*R” — A‘R™. 
Similarly, replacing T by T™ yields 


AKT: APR™ — AFR”. 


3. Show that A*T is uniquely characterized as a linear map from A*R™ to A*R” that 
satisfies 


(A®T)(u1 A+++ A up) = (Tv1) A+++ A (Te), V5 ER”. 


4. If {e1,...,€n} is the standard orthonormal basis of R”, define an inner product on 
A®R" by declaring an orthonormal basis to be 


{ej, Av: AN ej, 11S ji <+t+ < ge < nh. 
Show that if {ui,..., Un} is any other orthonormal basis of R”, then the set 
{uj, Aves Aug, 21 < ja <ee+ < je <n} 


is an orthonormal basis of AR”. 
5. Let F bea vector field on U, open in R?, F = ee f(x) 0/Ox;. Consider the 1-form 
y = 2? f(x) dx;. Show that dy and curl F are related in the following way: 


3 
) 
curl F = S> 95(2) aa 
7 j 


dy = gi(x) dx2 A dx3 + go(x) dx3 A dxi + g3(x) dri A dre. 


6. If F and ¢ are related as in Exercise 5, show that curl F' is uniquely specified by the 
relation 
dp A a= (curl F,a)w 
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for all 1-forms a on U C R°, where w = dz, A dx2 A dxz3 is the volume form. 

7. Suppose fo, f: : X — Y are smoothly homotopic maps, via ® : X x R — 
Y, ®(x,7) = fj(a). Leta € A*Y be closed. Apply (13.60) to @ = ®*a € 
A(X x R), with F;(x,s) = (x,s +), to obtain 8 € A*-1(X x R) such that 
F*a — & = df, and from there produce 8 € A*~1!(X) such that fia — fga = df. 
(Hint: Use 3 = 1* 8, where u(x) = (x,0).) 


For the next set of exercises, let 2 be a planar domain, X = f(x,y) 0/Ox 
g(x,y) 0/Oy a nonvanishing vector field on 2. Consider the 1-form a = 
g(x,y) dx — f(x,y) dy. 

8. Let y : I — © be a smooth curve, J = (a,b). Show that the image C = 7(J) is 
the image of an integral curve of X if and only if y*a = 0. Consequently, with slight 
abuse of notation, one describes the integral curves by g dx — f dy = 0. If a is exact 
(i.e., @ = du,) conclude that the level curves of wu are the integral curves of X. 

9. A function ¢ is called an integrating factor if @ = ya is exact (i.e., if d(ya) = 0, 
provided 2 is simply connected). Show that an integrating factor always exists, at 
least locally. Show that y = e” is an integrating factor if and only if Xv = — div X. 
Reconsider Exercise 7 in §7. Find an integrating factor for a = (x? + y* — 1) dx 
2xry dy. 

10. Let Y be a vector field that you know how to linearize (i.e., conjugate to 0/Ox) and 
suppose Ly a = 0. Show how to construct an integrating factor for a. Treat the more 
general case £xa = ca for some constant c. Compare the discussion in §8 of the 
situation where [X, Y] = cX. 


+ 
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Recall from §10 that a Hamiltonian vector field on a region Q C R?”, with coor- 
dinates ¢ = (x, €), is a vector field of the form 


“of a Of oO 
14.1 Hy = >>| , 
( ) f yy 0&; Ox; Ox; 0&; 
We want to gain an understanding of Hamiltonian vector fields, free from coor- 
dinates. In particular, we ask the following question. Let F : O > Q bea 
diffeomorphism, and let Hy be a Hamiltonian vector field on (2. Under what con- 
dition on Fis F'4 Hy a Hamiltonian vector field on O? 
A central object in this study is the symplectic form, a 2-form on R?” defined by 


(14.2) o= » dé; \ dx;. 


j=1 


Note that if 
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U=D [Ogre te Og] V= LP Og, +¥ Os] 


(14.3) o(U,V) = Ss [—u3 (¢)b? (¢) + a7 (¢)v4(Q)]. 


In particular, o satisfies the following nondegeneracy condition: If U has the prop- 
erty that, for some (a9, 0) € R?”, o(U,V) = 0 at (ao, £0) for all vector fields 
V, then U must vanish at (2, €)). The relation between the symplectic form and 
Hamiltonian vector fields is as follows: 


Proposition 14.1. The vector field Hy is uniquely determined by the identity 
(14.4) o|Hy = —df. 

Proof. The content of the identity is 

(14.5) o(H;y,V)=—-Vf, 


for any smooth vector field V. If V has the form used in (14.3), then that identity 
gives 
of 


U 
Ox; 


o(Hy,V) =-> [2h 094+ Lo], 


0&; 


which coincides with the right side of (14.5). In view of the nondegeneracy of a, 
the proposition is proved. Note the special case 


f= 


(14.6) o(Hs, Hy) = {f,9}- 
The following is an immediate corollary. 


Proposition 14.2. If O,Q are open in R?”, and F : O + Q is a diffeomorphism 
preserving o, that is, satisfying 


(14.7) F*a=0, 
then for any f € C°(Q), FH, is Hamiltonian on Q and 
(14.8) Fully = Hrs, 


where F* f(y) = f(F(y)). 


A diffeomorphism satisfying (14.7) is called a canonical transformation, or a 
symplectic transformation. Let us now look at the condition on a vector field X 
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on © that the flow F{ generated by X preserve o for each t. There is a simple 
general condition in terms of the Lie derivative for a given form to be preserved. 


Lemma 14.3. Let a € A*(Q). Then (Fe) = a for all t if and only if 
La = 0. 


Proof. This is an immediate consequence of (13.57). 


Recall the formula (13.49): 
(14.9) Lxa=d(a|X) + (da)|X. 


We apply it in the case where a = o is the symplectic form. Clearly, (14.2) 
implies 


(14.10) do =0, 
NiO) 
(14.11) Lxo =d(o|X). 


Consequently, F{ preserves the symplectic form o if and only if d(o|X) = 0 
on (2. In view of Poincaré’s lemma, at least locally, one has a smooth function 
f(a, &) such that 


(14.12) o|X =df, 
provided d(o |X) = 0. Any two f’s satisfying (14.12) must differ by a constant, 


and it follows that such f exists globally provided ( is simply connected. In view 
of Proposition 14.1, (14.12) is equivalent to the identity 


(14.13) X = —H;. 
In particular, we have established the following result. 


Proposition 14.4. The flow generated by a Hamiltonian vector field H ¢ preserves 
the symplectic form o. 


It follows a fortiori that the flow F* generated by a Hamiltonian vector field 
Hy leaves invariant the 2n-form 


v=oaA---Ao_ (n factors), 


which provides a volume form on Q. That this volume form is preserved is known 
as a theorem of Liouville. This result has the following refinement. Let S be a 
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level surface of the function f; suppose f is nondegenerate on S. Then we can 
define a (2n — 1)-form w on S (giving rise to a volume element on S) which is 
also invariant under the flow F*, as follows. Let X be any vector field on Q such 
that X f = 1 on S, and define 


(14.14) w= 7 (v|X), 
where j : S <> (2 is the natural inclusion. We claim this is well defined. 


Lemma 14.5. The form (14.14) is independent of the choice of X, as long as 
Xf=lonS. 


Proof. The difference of two such forms is j7*(v|¥1), where Yi f = 0 on S, that 
is, Y; is tangent to S. Now this form, acting on vectors Y2,..., Yon, all tangent to 
S, is merely (j*v)(¥1,..., Yan); but obviously j*v = 0 since dim S < 2n. 


We can now establish the invariance of the form w on S. 
Proposition 14.6. The form (14.14) is invariant under the flow F* on S. 
Proof. Since v is invariant under F*, we have 

Few = j\ (Ful FyX) 
= i (W)F4X) 
=wtj*(v|(FyX — X)). 


Since F™ f = f, we see that (Fj, X)f = 1 = Xf, so the last term vanishes, by 
Lemma 14.5, and the proof is complete. 


Let O C R” be open; we claim that the symplectic form o is well defined on 
T*O =O x R”, in the following sense. Suppose g : O — 22 is a diffeomorphism 
(i.e., a coordinate change). The map this induces from T*O to T*Q? is 


(14.15) G(x, €) = (g(x), ((Dg)*)"(#)€) = (yn). 
Our invariance result is 
(14.16) G*o =a. 


In fact, a stronger result is true. We can write 


(14.17) e=dn, w= > 6 da;, 
j 


where the 1-form « is called the contact form. We claim that 
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(14.18) Gk =k, 


which implies (14.16), since G*dx = dG*k. To see (14.18), note that 
dy; = > 2% dee, ny => Hike 
fj - Dan » - GESE 


where (H;,) is the matrix of ((Dg)*) ~* that is, the inverse matrix of (Oge/Ox;). 
Hence 


Og; 
se nj dyj = » OE Hye&e dry, 
J ZK zL 
(14.19) = So SpeEe date 
ke 


= So & dzp, 
. 


which establishes (14.18). 

As a particular case, a vector field Y on O, generating a flow F{- on O, induces 
a flow G{- on T*O. Not only does this flow preserve the symplectic form; in fact, 
Gi is generated by the Hamiltonian vector field Hg, where 


(14.20) O(9,4) =(¥ @),4) = a E04 (a) 


if ¥ =) w (2) 0/0n;. 

The symplectic form given by (14.2) can be regarded as a special case of a 
general symplectic form, which is a closed, nondegenerate 2-form on a domain (or 
manifold) 2. Often such a form w arises naturally, in a form not a priori looking 
like (14.2). It is a theorem of Darboux that locally one can pick coordinates in 
such a fashion that w does take the standard form (14.2). We present a short proof, 
due to J. Moser, of that theorem. 

To start, pick p € Q, and consider B = w(p), a nondegenerate, antisymmetric, 
bilinear form on the vector space V = T,,). It is a simple exercise in linear algebra 
that if one has such a form, then dim V must be even, say 2n, and V has a basis 
{e;, fj : 1 <j <n} such that 


(14.21) Ble;, ee) = BU fj, fe) =0, Ble;, fe) = dye, 
for 1 < j,£ < n. Using such a basis to impose linear coordinates (a,€) on a 


neighborhood of p, taken to the origin, we have w = wo = Do d&; A da; at p. 
Thus Darboux’ theorem follows from: 
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Proposition 14.7. [fw and wo are closed, nondegenerate 2-forms on Q, and w = 
wo at p € Q, then there is a diffeomorphism G defined on a neighborhood of p, 
such that 


(14.22) Gi(p) =p and Gyw = uw. 

Proof. For ¢ € [0, 1], let 

(14.23) w, = (l—thupttw=uptta, a=w—w. 
Thus a = 0 at p, and a is a closed 2-form. We can therefore write 
(14.24) a=dp 


on a neighborhood of p, and if @ is given by the formula (13.61) in the proof of 
the Poincaré lemma, we have 3 = 0 at p. Since for each t, uw; = w at p, we see 
that each w, is nondegenerate on some common neighborhood of p, for t € [0, 1]. 

Our strategy will be to produce a smooth family of local diffeomorphisms 
G,, 0 < t < 1, such that G;(p) = p, Go = id., and such that Gjw; is inde- 
pendent of t, hence Guy = wo. G; will be specified by a time-varying family of 
vector fields, via the ODE 


(14.25) “ Be Riew),. Aaa 


We will have G;(p) = p provided X;(p) = 0. To arrange for Gj w; to be indepen- 
dent of t, note that, by the product rule, 


d dw 


(14.26) 7 Lie = GiLx wr + GE. 


By (14.23), dw,/dt = a = d{, and by Proposition 13.1, 
(14.27) Lyx ,w; = d(wr| Xz) 


since w; is closed. Thus we can write (14.26) as 


d 
— Gru, = Gyd(ur| Xt + 8). 


14.2 
oe) dt 


This vanishes provided X; is defined to satisfy 


(14.29) w, |X, = 8. 
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Since w; is nondegenerate near p, this does indeed uniquely specify a vector field 
X; near p, for each ¢ € [0,1], which vanishes at p, since 6 = 0 at p. The proof of 
Darboux’ theorem is complete. 


Exercises 


1. Do the linear algebra exercise stated before Proposition 14.7, as a preparation for the 
proof of Darboux’ theorem. 
2. On R?, identify (x, €) with (x, y), so the symplectic form is o = dy A dx. Show that 


a 
AT a oy 


and a = gdzx — f dy 


are related by 
a=oal|X. 


Reconsider Exercises 8-10 of §13 in light of this. 

3. Show that the volume form w on the level surface S of f, given by (14.14), can be 
characterized as follows. Let S;, be the level set {f(x,€) = c+h}, S = So. Given 
any vector field X transversal to S, any open set O C S with smooth boundary, let On 
be the thin set sandwiched between S and 5},, lying on orbits of X through O. Then, 
with v =a A--- Ac the volume form on Q, 


w = lim 2 Vv 
> hoo h , 
re) }, 


4. A manifold M C R?" is said to be coisotropic if, for each p € M, the tangent space 
T,M contains its symplectic annihilator 


Ty ={w € R™ : o(v,w) = 0 forall v € TM}. 
It is said to be Lagrangian if T, MM = Tj for all p € M. If M is coisotropic, show that 


it is naturally foliated by manifolds {NV} such that, for p € Ng, Tp>Nq = Tp . (Hint: 
Apply Frobenius’s theorem.) 


15. First-order, scalar, nonlinear PDE 
This section is devoted to a study of PDE of the form 
(15.1) F(x,u, Vu) = 0, 


for areal-valued u € C™(Q), dim Q = n, given F(x, u, €) smooth onQxRxR", 
or some subdomain thereof. We study local solutions of (15.1) satisfying 


(15.2) uls =v, 
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where S is a smooth hypersurface of 2, v € C™(S). The study being local, 
we suppose S' is given by x, = 0. Pick a point 79 € S C R”, and set Go = 
(Ov/O21,...,0v/OXy_1) at xo. Assume 


F(xo, v(x), (Go, To)) = 9, 


15.3 
wee) a # 0 at this point. 


We call this the noncharacteristic hypothesis on S. We look for a solution to 
(15.1) near xo. 

In the paragraph above, Vu denotes the n-tuple (Ou/0x1,...,0u/Ory). In 
view of the material in §§13 and 14, one should be used to the idea that the 1-form 
du = )>(0u/0z,;) dx; has an invariant meaning. As we will see later, a Rieman- 
nian metric on 22 then associates to du a vector field, denoted grad w. 

Thus, we will rephrase (15.1) as 


(15.4) F(a,u, du) = 0. 


We think of F' as being defined on T*Q x R, or some open subset of this space. 
The first case we will treat is the case 


(15.5) F(x, du) = 0. 


This sort of equation is known as an eikonal equation. From the treatment of 
(15.5), we will be able to deduce a treatment of the general case (15.4), using a 
device known as Jacobi’s trick. 

The equation (15.5) is intimately connected with the theory of Hamiltonian 
systems. We will use this theory to construct a surface A in R?”, of dimension n, 
the graph of a function € = =(x), which ought to be the graph of du for some 
smooth uw. Thus our first goal is to produce a geometrical description of when 


(15.6) A = graph of € = E(a) 
is the graph of du for some smooth wu. 


Proposition 15.1. The surface (15.6) is locally the graph of du for some smooth 
u if and only if 


ck WG, k. 


15.7 = 
oe) Ox, Ox,’ 


Proof. This follows from the Poincaré lemma, since (15.7) is the same as the 
condition that }> =, (x) dx; be closed. 


The next step is to produce the following geometrical restatement. 
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Proposition 15.2. The surface A of (15.6) is the graph of du (locally) if and only 
if o(X,Y) = 0 for all vectors X,Y tangent to A, where o is the symplectic form. 


If A satisfies this condition, and dim A = n, we say A is a Lagrangian surface. 


Proof. We may as well check o(X,;, X,) for some specific set X1,...,Xn of 
linearly independent vector fields, tangent to A. Thus, take 


O= 
ss aay Doe ae 
J 


In view of the formula (14.3), we have 


d=, 05; 
15.9 6.3 =, 
( ) a J k) Ox; Ox, 


so the result follows from Proposition 15.1. 


To continue our pursuit of the solution to (15.5), we next specify a surface ©, 
of dimension n — 1, lying over S = {x, = 0}, namely, with 0jv = Ov/Oz;, 


(15.10) = {(z,€): a, =0, €; =0;v, forl <j <n-—1, F(a,€) =O}. 


The noncharacteristic hypothesis implies, by the implicit function theorem, that 
(with a’ = (x1,...,2n—1)), the equation 


F(a',0;01v,...,0n—-10,T) =0 


implicitly defines 7 = T(x’), so (15.10) defines a smooth surface of dimension 
n — 1 through the point (9, (Co, 70)). 

We now define A to be the union of the integral curves of the Hamiltonian 
vector field Hp through ». Note that the noncharacteristic hypothesis implies that 
Hy has a nonvanishing 0/0x,, component over S’, so A is a surface of dimension 
n, and is the graph of a function € = &(a), at least for x close to xo (Fig. 15.1). 
Since F' is constant on integral curves of Hp, it follows that F' = 0 on A. 


Theorem 15.3. The surface A constructed above is locally the graph of du, for a 
solution u to 


(15.11) F(z,du)=0, ulg =v. 


Proof. We will show that A is Lagrangian. So let X,Y be vector fields tangent to 
A at (x, €) in A C R?”. We need to examine o( X,Y). First suppose x € S (ie., 
(a, €) € S). Then we may decompose X and Y into X = X,+X2, Y = ¥i+Y9, 
with X,, Y, tangent to © and X»2, Y> multiples of Hy at (x, €). It suffices to show 
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that 0(X1,Y1) = 0 and o( Xj, Y2) = 0. Since %, regarded simply as projecting 
over {x,, = O}, is the graph of a gradient, Proposition 15.2 implies o(.X1, Y1) = 
0. On the other hand, o(.X1, Y2) is a multiple of 0(X,, Hr) = (X1, dF) = Xi F. 
Since X, is tangent to } and F = Oon, X,;F =0. 

Thus we know that o(X,Y) = Oif X and Y are tangent to A at a point in X. 
Suppose now that X and Y are tangent to A at a point F“(x, €), where (7, €) € ¥ 
and F* is the flow generated by Hr. We have 


o(X,Y) = (Fo) (FX, FLY). 


Now Fyx and Fy are tangent to A at (x,€) € X. We use the important fact 
that the flow generated by Hp leaves the symplectic form invariant to conclude 
that 

o(X,Y) = 0(FX, FLY) =0. 


This shows that A is Lagrangian. 

Thus A is the graph of du for some smooth u, uniquely determined up to 
an additive constant. Pick 7 € S and set u(ao) = v(ao). We see that, on 
S, Ou/Oxz; = Ov/Ox; for1 < 7 < n—1, so this forces ulg = v. We have 
seen that f' = 0 on A, so we have solved (15.11). 


An important example of an eikonal equation is 


(15.12) |dy|? =1 


(Xo; (Go; To)) A 


integral curve of Hr 


x-space 


FIGURE 15.1 Lagrangian Surface 
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on a Riemannian manifold, with metric tensor g;;,. In local coordinates, (15.12) is 


(15.13) S- 9'*(a) OO. oy. 


- Ox; OXp 


where, as before, (g/”) is the matrix inverse to (g;,). We want to give a geometri- 
cal description of solutions to this equation. Let y be specified on a hypersurface 
S Cc M;¢\|s = v. Assume that |d¢| < 1 on S. Then there are two possible sec- 
tions of T* M over S, giving the graphs of dy over S. Pick one of then; call it 0. 
As we have seen, the graph of dy is the flow-out A of 5, via the flow generated by 
Hy, with f(x, €) = (1/2)|€/? = (1/2)  g/*(x)€;€, that is, via the “geodesic 
flow” on T*M. The projections onto M of the integral curves of Hy in T*M 
are geodesics on M. The geometrical description of y arises from the following 
result. 


Proposition 15.4. The level surfaces of p are orthogonal to the geodesics that 
are the projections on M of the integral curves of Hy through &. 


Proof. If we consider a point « € M over which A is the graph of dy, we have 
(a,€) € A, € = dy(x). The assertion of the proposition is that the metric tensor, 
inducing an isomorphism T* M ~ T,,M, identifies € with 7/(t), where 7’(t), the 
tangent vector to such a geodesic, is the projection onto T,M of Hy at (x, &). 
Since 


5 of od of a 
(15.14) Hr= yas On; On; ag) 


this projection is equal to 


Of oO 0 
(15.15) »: 5 On; S > 9 * (a) Ba,’ 


which is in fact the image of € € JT under the natural metric isomorphism 
T*M = T,M. This proves the proposition. 


We can restate it this way. The metric isomorphism T* MM =~ TM produces 
from the 1-form dy, the gradient vector field grad yp. In local coordinates, with 


dy = )'(0p/0x;) dx;, we have 
+, .Op O 
15.16 dy= jk fs es Ne 
( ) grad S- g)” (a) Ges On 
Thus, the content of the last proposition is the following: 


Corollary 15.5. If y(t) is the geodesic of unit speed that is the projection on M 
of an integral curve of H+ through &, then 
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(15.17) grad p(x) = y(t), atx = y(t). 


Suppose, for example, that for an initial condition on ~ we take y = c (con- 
stant) on the surface S. Then, near S$, the other level sets of y are described as 
follows. For p € S, let yp(t) be the unit-speed geodesic through p, so yp)(0) = p, 
orthogonal to S, going in one of two possible directions, corresponding to a choice 
of one of two possible Us, as mentioned above. Then 


(15.18) p(x) =c+t, atx =7,(t). 
This gives a very geometrical picture of solutions to (15.12). 

On flat Euclidean space, where geodesics are just straight lines, these formulas 
become quite explicit. Suppose, for example, that we want to solve |dy|? = 1 on 
R” (ie., )>(Ov/Ox;)? = 1), and we prescribe 
(15.19) y = Oona surface S defined by w(x) = 0, 
where (2) is given. Then it is clear that, for |t] not too large, y is defined by 


(15.20) y(z + t|Vip(x)|"Vu(z)) =t, fora € S. 


For small a, I = (—a, a), the map 


(15,215 U:SxI—R" 
given by 
(15.22) W(2,t) = 24 t/Vu(2)|~'Vv(z) 


is a diffeomorphism, but simple examples show that this can break down for 
large ||. 

Having solved the special sort of first-order PDE known as the eikonal equa- 
tion, we now tackle the general case (15.1)—(15.2), subject to the condition (15.3). 
We use a method, called Jacobi’s trick, of defining wu implicitly by 


(15.23) V(a,u(x)) =0 


and producing a PDE for V of the eikonal type. Indeed (15.23) gives, with V = 
V(a, 2), 


(15.24) VizV+V,Vu=0, or Vu=—V,'V,V, 
so set 


(15.25) (2, z,€,¢) = F(a,z,-¢~*€). 
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Our equation for V is hence F(x, z, -V,1VV) = 0, or 
(15.26) g(@, 2, Va2V) = 0. 
This is of eikonal type. Our initial condition is 
(15.27) V=z-v onz, = 0. 


This gives V, 4 0 locally, so by the implicit function theorem, (15.23) defines a 
function u(x), which solves the system (15.1)—(15.2). 


Exercises 


1. Let X be a vector field on a region 2, generating a flow F*, which we will assume is 
defined everywhere. Consider the linear PDE 


(15.28) ou =Xu, u(0,x2) = f(z). 
Show that a solution is given by 


u(t, 2) = f(F’2). 


Show that the equation 
(15.29) — =Xu+g(t,z), u(0,2) = f(x) 


is solved by 


and that 
Ou 
(15.30) pp = xutalt zu, u0,2) = f(x) 
is solved by 
ba Ss, t—Sa)ds 
u(t, x) = AC: ts f(F'2). 


(Hint: The solution to (15.28) is constant on integral curves of 0/0t—X in RxQ. Apply 
Duhamel’s principle to (15.29). Then find A(t, x) such that (15.30) is equivalent to 


ga (4 - x) (e4u) =0.) 


2. A PDE of the form 


for a real-valued u = u(t, 2), is a special case of a quasilinear equation. Show that 
if we set u(0O,x) = v(x) € C®(IR”), then there is a unique smooth solution in a 
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neighborhood of {0} x R” in R"*?, and u(t, x) has the following property. For each 
xo € R”, consider the vector field 


O ts O 
Veo = Ot ot 2, 25(2,u(a0)) Oa; 


Then w(t, x) is equal to v(xo) on the integral curve of Vz, through (0, xo). Considering 
the example 


ae 
UW +UUz =0, u(0,z)=e” , 
show that this smooth solution can cease to exist globally, due to two such lines 


crossing. 
3. Work out explicitly the solution to 


satisfying v(x, y) = 0 on the parabola y = x”, and Oy/dy > 0 there, using (15.19) 
and (15.20). Write a computer program to graph the level curves of y. How does the 
solution break down? 

4. The group of dilations of T* M, defined (in local coordinates) by D(r)(x, €) = (x, r&), 
is generated by a vector field 7? on T* M, which we call the natural radial vector field. 
Show that ? is uniquely specified by the identity 


o(0, X) = (X,«), 


when X is a vector field on T* M, and « = 5~ &; da; is the contact form (14.17). 

5. Suppose A is a submanifold of T* M of dimension n = dim M, withu: A T*M. 
Show that A is Lagrangian if and only if .*« is a closed 1-form on A (hence locally 
exact). If A is Lagrangian, relate .*« = df on A to du, in the context of Proposition 
1551; 

6. Suppose A is a Lagrangian submanifold of T* M, transverse to 7. Define a subbundle 
V of TA by 

Veo,e) = (9) O Tree A, 
where (W)° is the set of vectors v € T(z,e)T* M such that o(0,v) = 0. Show that V 
is an integrable subbundle of TA, that is, that Frobenius’s theorem applies to V, giving 
a foliation of A. If A is the graph of du, u € C°(M), show that the inverse image, 
under 7 : A — M, of the level sets of u gives the leaves of this foliation of A. 


16. Completely integrable hamiltonian systems 


Here we will examine the consequences of having n “conservation laws” for a 
Hamiltonian system with n degrees of freedom. More precisely, suppose O is a 
region in R?”, with coordinates (x, €) and symplectic form ¢ = ea d&; \dz;, 
or more generally O could be a symplectic manifold of dimension 2n. Suppose 
we have n functions u1,..., Un, In involution, that is, 
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(16.1) {uj,ur}=0, 1L<jk<n. 


The function u; = F' could be the energy function whose Hamiltonian vector field 
we want to analyze, and u2,..., U», auxiliary functions, constructed to reflect con- 
servation laws. We give some examples shortly. In case one has n such functions, 
with linearly independent gradients, one is said to have a completely integrable 
system. 

Our goal here will be to show that in such a case the flows generated by the H,,, 
can be constructed by quadrature. We define the last concept as follows. Given 
a collection of functions {u,;}, a map is said to be constructed by quadrature if it 
is produced by a composition of the following operations: 


(a) Elementary algebraic manipulation 
(b) Differentiation 

(c) Integration 

(d) Constructing inverses of maps 


To begin the study of a completely integrable system, given (16.1), consider, 
for a given p € R”, the level set 


(16.2) Mp = {(x,€) € O : uj(x, €) = pj}. 
Assuming the u; have linearly independent gradients, each nonempty M,, is a 
manifold of dimension n. Note that each vector field H,, is tangent to M,, by 
(16.1), and therefore {H,,, : 1 < j < n} spans the tangent space to M, at each 
point. Since o(Hy,,, Hu,) = {uj, ur}, we conclude from (16.1) that 
(16.3) each M,, is Lagrangian. 

If we make the “generic” hypothesis 
(16.4) am: My, — R” isa local diffeomorphism, 
where m(z,€) = 2, then M, is the graph of a closed 1-form =, (depending 
smoothly on p); note that =,(x) is constructed by inverting a map, one of the 
operations involved in construction by quadrature. Furthermore, =, being closed, 
we can construct a smooth function p(x, p) such that 
(16.5) M, is the graph of x +> dzp(z, p). 
The function y(a, p) is constructed from =,, by an integration, another ingredient 


in construction by quadrature. Note that a statement equivalent to (16.5) is that 
simultaneously satisfies the eikonal equations 


(16.6) it dee(tp) =p, Loge n. 
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Consider now the following maps: 
F 
(x, p) : (dpy(x, p),p) 
(16.7) c| 
F2 


Since F2(x,p) = (x,=,(x)), it is clear that F2 is a local diffeomorphism under 
our hypotheses. This implies that the matrix 


Op 


is invertible, which hence implies that F’, is a local diffeomorphism (by the inverse 
function theorem). Hence C is locally defined, as a diffeomorphism: 


(16.9) C(dpp(x,p),p) = (2, dey(a,p)). 


Write C(q, p) = (x, €). Note that 


r 0 
FS" dé; Adxy =~ Doda; Pe bs 
Ch ie 


(16.10) 
= FES dp; A dq;, 

SO 

(16.11) ey, dé; A de;) = So dp; Adq;, 


that is, C preserves the symplectic form. One says C is a canonical transformation 
with generating function p(x, p). Now conjugation by C takes the Hamiltonian 
vector fields H,,, on (a, €)-space to the Hamiltonian vector fields Hz, on (q, p)- 
space, with 

ti3(q,p) = Uj 0C(G,P) = Pj; 
in view of (16.6). Thus 


(16.12) 2 eee ola 


so C conjugates the flows generated by H,,, to simple straight-line flows. This 
provides the construction of the H,,-flows by quadrature. 

Note that if O has dimension 2, one needs only one function u,. Thus the 
construction above generalizes the treatment of Hamiltonian systems on R? given 
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in §10. In fact, the approach given above, specialized to n = 1, is closer to the 
analysis in §10 than it might at first appear. Using notation as in §10, let uw; = 
f, pi = E, so 

Mp = {(@,§) : f(w,§) = E} 


is the graph of € = w(a, E) = dzyp(a, E), with 


y(a, FE) = ikice dx. 


Note that f(z,~(c, E)) = E => fewer =1, 80 


(16.13) dry(2, E) = f te(x,0@, 8)“ ae, 


and C maps ([ fe ae, E) to (x, w (a, €)). To say C conjugates Hy to Hz = 0/0q 
(in (q, Z) coordinates) is to say that under the time-t Hamiltonian flow, [ ie ‘da 
is augmented by f; but this is precisely the content of (10.16), namely, 


(16.14) [ee (x, (a, E))* dx =t+ C(E). 


We also note that, for the purpose of linearizing H.,,,, it suffices to have y(, p), 
satisfying only the eikonal equation 


(16.15) ui (x, de~(x,p)) = pi, 


such that the matrix (16.8) is invertible. The existence of ug,...,U,, which 
together with uw, are in involution, provides a way to construct v(x, p), but any 
other successful attack on (16.15) is just as satisfactory. Integrating H,,, by per- 
ceiving solutions to (16.15) is the essence of the Hamilton-Jacobi method. 

We now look at some examples of completely integrable Hamiltonian sys- 
tems. First we consider geodesic flow on a two-dimensional surface of revolution 
M? Cc R®. Note that T* M? is four-dimensional, so we want uw, and us, in involu- 
tion. The function w; is, of course, the energy funtion u; = (1/2) > g?*(x)EjEx; 
as we have seen, H,,, generates the geodesic flow. Our function ug will arise 
from the group of rotations Rg of M? about its axis of symmetry, 0 € R/27Z. 
This produces a group Rg of canonical transformations of T* M7, generated by 
a Hamiltonian vector field X = H,,, with uo(x,€) = (0/00,€). Since Rg is 
a group of isometries of WM 2 Ro preserves uj (i.e., Xu, = 0), or equivalently, 
{u2z,ui} = 0. We have our pair of functions in involution. Thus geodesics on 
such a surface of revolution can be constructed by quadrature. 

Another important class of completely integrable Hamiltonian systems is pro- 
vided by motion in a central force field in the plane R?. In other words, let x(t), a 
path in R?, satisfy 
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(16.16) £=-VV(z), V(a) = v(\aI). 


The Hamiltonian system is 


(16.17) t=VeF, €=-V,F, 
with 

1 
(16.18) F(x, £) = 5I€? + v(|a\). 


We take wu; = F and look for ua, in involution. Again u2 arises from a group of 
rotations, this time rotations of R? about the origin. The method we have given by 
which a vector field on Q produces a Hamiltonian vector field on T*Q yields the 
formula 


us(a,€) = (6) 


=( He win re =f) 


= x1€2 = x2). 


(16.19) 


This is the “angular momentum.” The symmetry of V() implies that the group 
of rotations on T* R? generated by H,,, preserves F' = wy, that is, 


(16.20) {u1, ua} = 0, 


a fact that is also easily verified from (16.18) and (16.19) by a computation. 
This expresses the well-known law of conservation of angular momentum. It also 
establishes the complete integrability of the general central force problem on R?. 
We remark that, for the general central force problem in R”, conservation of angu- 
lar momentum forces any path to lie in a plane, so there is no loss of generality in 
studying planar motion. 

The case 


(16.21) V(x) = “Tal 


(K > 0) 
of the central force problem is called the Kepler problem. It gives Newton’s 
description of a planet traveling about a massive star, or of two celestial bodies 
revolving about their center of mass. We will give a direct study of central force 
problems, with particular attention to the Kepler problem, in the next section. 
These examples of completely integrable systems have been based on only the 
simplest of symmetry considerations. We will give a treatment below of the free 
motion of a rigid body in R”, for which symmetry plays a more subtle role. For 
many other examples of completely integrable systems, see [Wh]. 
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We have dealt here only with the local behavior of completely integrable sys- 
tems. There is also an interesting “global” theory, which among other things 
studies the distinction between the regular behavior of completely integrable 
systems on the one hand and varieties of “chaotic behavior” exhibited by (glob- 
ally) nonintegrable systems on the other. The reader can find out more about 
this important topic (begun in [Poi]) in [Mos], [TS], [Wig], and references given 
therein. 


Exercises 


1. Let wi(a, €) = (1/2)|€|? — |a|~1 be the energy function for the Kepler problem (with 
K = 1), and let u2(x, €) be given by (16.19). Set 


v;(#,€) = aj|e|* — aE? +(@- OE), jf =1,2. 


(v1, v2) is called the Lenz vector. Show that the following Poisson bracket relations 


hold: 
{ui,v;} =0, j=1,2, 


{u2,v;} = +u;, 


{v1, v2} = 2uiu2. 


Also show that 
vi + vs — Quius =1. 


2. Deduce that the Kepler problem is integrable in several different ways. Can you relate 
this to the fact that all bounded orbits are periodic? 


In Exercises 3-5, suppose a given M,, as in (16.2), is compact, and duj;,1 <j <n 
are linearly independent at each point of Mp. 

3. Show that there is an R”-action on Mp, defined by &(t)(¢) = Fy! --- Fin ¢, fort = 

t1,...;tn),¢ © Mp, where F; is the flow generated by H,,,. Show that ®(t + s)¢ = 

P(t) ®(s)C. 

4. Show that R” acts transitively on Mp, that is, given ¢ € Mp, O(C) = {®(f)C : t € 

R”} is all of M,. (Hint: Use the linear independence to show O(¢) is open. Then, if ¢, 

is on the boundary of O(¢) in Mz, show that O(¢1) N O(¢) £9.) 

5. Fix Co € My and let l = {t € R” : ®(t)¢o = Co}. Show that M, is diffeomorphic to 

R”/T and that this is a torus. 

6. Ifui = Fcan be extended to a completely integrable system in two different ways, with 
the setting of Exercises 3-5 applicable in each case, then phase space may be foliated 
by tori in two different ways. Hence intersections of various tori will be invariant under 
Hr. How does this relate to Exercise 2? 


17. Examples of integrable systems; central force problems 


In the last section it was noted that central force problems give rise to a class of 
completely integrable Hamiltonian systems with two degrees of freedom. Here 
we will look at this again, from a more elementary point of view. We look at a 
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class of Hamiltonians on a region in R*, of the form 


(17.1) F(y,n) = F(y1,™, 2); 


that is, with no y2-dependence. Thus Hamilton’s equations take the form 
(17.2) Y=a Ma=a-z m=O. 
In particular, 72 is constant on any orbit, say 


(17.3) ieee 


This, in addition to F’, provides the second conservation law implying integrabil- 
ity; note that {F,72} = 0. If F(y1,m, L) = E onan integral curve, we write this 
relation as 


(17.4) m = by, L, B). 


We can now pursue an analysis that is a variant of that described by (10.14)- 
(10.20). The first equation in (17.2) becomes 


(17.5) in = Fu (y, vn, L, £), L), 
with solution given implicitly by 
= 
(17.6) / Fy, (vis 0(yi, L, B), L) dy =t + C. 


Once one has y;(¢), then one has 


(17.7) m(t) = (y(t), L, F), 


and then the remaining equation in (17.2) becomes 


(17.8) do = Fr. (y(t), m(t), L), 


which is solved by an integration. 
We apply this method to the central force problem, with 


(179) F(2,6) = SP + ve), 2 eR 


Use of polar coordinates is clearly suggested, so we set 


(17.10) Y=, y=9; x, =P7r cosé, r2 =r sind. 
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In these coordinates, the Euclidean metric dx? + dx3 becomes dr? + r7d0?, so, 
as in (12.31), the function F’ becomes 


1 = 
(17.11) F(y,n) = 5 (ni + yy °n2) + o(1). 
We see that the first pair of ODEs in (17.2) takes the form 
(17.12) t=m, @=Lr, 


where L is the constant value of 72 along an integral curve, as in (17.3). The last 
equation, rewritten as 


(17.13) r?6 = L, 


expresses conservation of angular momentum. The remaining ODE in (17.2) 
becomes 


(17.14) 1 = Lr? —y'(r). 

Note that differentiating the first equation of (17.12) and using (17.14) gives 
(17.15) P= Ler? ee), 

an equation that can be integrated by the methods described in (10.12)-(10.20). 
We will not solve (17.15) by this means here, though (17.15) will be used below, 


to produce (17.23). For now, we instead use (17.4)-(17.6). In the present case, 
(17.4) takes the form 


(17.16) m = +[2B — 2v(r) — 1?r-?]”?, 


and since F;,, = 7, (17.6) takes the form 


(17.17) + / [2Br? — 2r?v(r) — 22)? dr =t $C. 


In the case of the Kepler problem (16.21), where v(r) = —K/r, the resulting 
integral 


(17.18) & f (2B 42K — 1) Pr Fae ee 


can be evaluated using techniques of first-year calculus, by completing the square 
in 2Er? + 2Kr — L?. Once r = r(t) is given, the (17.13) provides an integral 
formula for 0 = 0(t). 
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One of the most remarkable aspects of the analysis of the Kepler problem is 
the demonstration that orbits all lie on some conic section, given in polar coordi- 
nates by 


(17.19) r[1 +e cos(@ — 0)] = ed, 


where e is the “eccentricity.” We now describe the famous, elementary but inge- 
nious trick used to demonstrate this. The method involves producing a differential 
equation for r in terms of 0, from (17.13) and (17.15). More precisely, we produce 
a differential equation for u, defined by 


(17.20) uw=rl. 
By the chain rule, 


dr du du dé du 
17.21 =p? — =-/? = 
ley dt. dt. do dt do’ 


in light of (17.13). Differentiating this with respect to t gives 


dr d du du dé PR a) 
722) ee “ga “ae a ape 


again using (17.13). Comparing this with (17.15), we get —L?u?(d?u/d6?) = 
L?u3 — v'(1/u) or, equivalently, 


du 1 
172 Ts + u=(Lu)*o'(=). 
(17.23) qe? U (Lu)~v . 
In the case of the Kepler problem, v(r) = —K/r, the right side becomes the 


constant A’/ L?, so in this case (17.23) becomes the linear equation 


@u K 
(17.24) qt es 
with general solution 
K 
(17.25) u(@) = Acos(@ — 6) + 


72’ 


which is equivalent to the formula (17.19) for a conic section. 
For more general central force problems, the (17.23) is typically not linear, but 
it is of the form treatable by the method of (10.12)—(10.20). 
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Exercises 


1. Solve explicitly w(t) = —w/(t), for w taking values in R? = C. Show that |w(t)|? + 
|w’ (t)|? = 2F is constant for each orbit. 
2. For w(t) taking values in C, define a new curve by 


= 2 dr _ 2 
Z(r) = w(t), = jw(e. 
Show that if w’’(t) = —w(t), then 


Giga ap 


that is, Z(7) solves the Kepler problem. 

3. Analyze the flow of Hr, for F' of the form (17.1), in a manner more directly parallel 
to the approach in §16, in a spirit similar to (16.13) and (16.14). Note that, with uw: = 
F, u2 = 2, pi = E, pe = L, the canonical transformation C of (16.9) is defined by 


C( fF diye — f Fat Fe dus£,L) = (y,yi8u.£,2),2), 


where the first integrand is Fy, (y1, (yi, L, E), L) ~" and so on. 
4. Analyze the (17.23) for u(@) in the following cases. 


(a) u(r) = —K/r? 
(b) v(r) = Kr? 
(c) u(r) = —K/r+er? 


Show that, in case (c), u(@) is typically not periodic in 0. 

5. Consider motion on a surface of revolution, under a force arising from a rotationally 
invariant potential. Show that you can choose coordinates (r, #) so that the metric tensor 
is ds* = dr? + B(r)~' d6?, and then you get a Hamiltonian system of the form (17.2) 
with 


1 1 
F(y1,™m,"2) = 5M + 5 Blum + v(y1), 


where y1 = Tr, yo = 9. Show that, parallel to (17.16) and (17.17), you get 


1/2 


* = +[2E — 2v(r) LT’ B(r)| 


Show that wu = 1/r satisfies 


> F Tgao 2 2»(=) Ha(=)) 


18. Rigid body motion in R” 


Suppose there is a rigid body in R”, with a mass distribution at t = 0 given by 
a function p(x), which we will assume is piecewise continuous and has compact 
support. We also assume p > 0 and it is not identically zero. Suppose the body 
moves, subject to no external forces, only the constraint of being rigid. We pro- 
duce equations of motion for this body, following Euler. The method described 
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here can be regarded as a precursor to the method used in Chapter 17 to produce 
the Euler equations of ideal fluid flow. 

According to the Euler-Lagrange approach to mechanics introduced in §12, 
we Seek a critical path of the integrated kinetic energy, subject to the constraint of 
rigidity. If €(t, x) is the position at time t of the point on the body whose position 
at time 0 is x, then we can write the Lagrangian as 


(18.1) 1) = 5 ff o(w)larg(t2)/ aca. 
a 


Using center of mass coordinates, we will assume the center of mass of the body 
is at the origin, and its total linear momentum is zero, so 


(18.2) &(t,z) = W(t)e, W(t) € SO(n), 


where S'O(n) is the group of rotations of R”. Thus, describing the motion of the 
body becomes the problem of specifying the curve W(t) in SO(n). We can write 
(18.1) as 


(18.3) i@=sWwi= 5 | f o@iw' Wek deat, 
0 Re 


We look for a critical point, where we vary the family of paths W : [to, ti] > 
SO(n), keeping the endpoints fixed. 

We want to reduce the formula (18.3) for J(W) to a single integral, over t. To 
do this, we bring in the following. 


Lemma 18.1. /f A, B € M(n,R), then 
(18.4) ioe Bs) de Ir( ALB"), 
R” 
where LT, € M(n,R) is defined by 
(18.5) Ly = foe) ax’ dz. 
R 
Proof. It suffices to note that 
(18.6) (Ax, Br) = Tr( Azz’ B*), 


as a consequence of the identity (7,y) = Tray’, for x,y € R”, regarded as 
column vectors. 
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Note that Z,, is a symmetric, positive-definite n x n matrix. Now, using (18.4), 
we can write the Lagrangian (18.3) as 


J(W) = : [ Tr(W'(t)Z,W’(t)‘) dt 


(18.7) ; — 
as Q,(W'(t), W'(t)) dt, 


where (), is the inner product on M(n,R) defined by 
(18.8) Q,(A, B) = Tr(AZ,B"). 


Note that this inner product is invariant under left multiplication by elements of 
SO(n), ie., 


W € SO(n) > Q,(WA, WB) = Tr(W AT, B'W*) 


(18.9) 

= Q,(A, B). 
On the other hand, for W € SO(n), 
(18.10) Q,(AW, BW) = Tr(AWZ,W~'B’), 


which is equal to Q,(A, B) for all A,B € M(n,R) if and only if WZ, =Z,W. 
In turn, this holds for all W € SO(n) if and only if Z, is a scalar multiple of the 
identity matrix J. 

The problem of finding a critical path W : I + SO(n) isa special case of that 
of finding a geodesic on a surface S C R* treated in §11. Parallel to (11.49), the 
condition for a path W to be critical is 


(18.11) W"(t) LTww SO(n), Viel, 

orthogonality being with respect to the inner product Q,, ie., 

(18.12) A€ Tw) SO(n) = Q,(W" (t), A) = 0. 

Given V € SO(n), we can define the vector space Ty SO(n) as the space of all 
matrices W’(0), for smooth curves W : (—e,¢) > SO(n) satisfying W(0) = V. 
For example, 

(18.13) T;SO(n) = Skew(n) = {X € M(n,R): Xt = —X}, 


and, for V € SO(n), 
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Ty SO(n) ={VX : X € Skew(n)} 


(18.14) ={YV:Y € Skew(n)}. 


Comparison with the derivation of geodesic equations in §11 shows that these 
critical paths are geodesics on SO(n), where the length of a curve W : [to, ti] > 
SO(n) is given by 


(18.15) yw) = [ Qo(w'@, Wo)? at 


To proceed, we see from (18.12)-(18.14) that the condition for W : I - 
SO(n) to be a critical path for (18.7) is 


(18.16) Tr(W(t)-'W"(t)Z,X) =0, WX € Skew(n), 
upon setting A = W(t)X in (18.12). It is convenient to bring in 
(18.17) Z(t) = W(t) 'W'(2), 


and derive an equation for Z(t) from (18.16). For this task, the following obser- 
vation is useful. 


Proposition 18.2. If W : I — SO(n) is a smooth curve, then 
(18.18) Z(t) €Skew(n), Vte I. 
Proof. Differentiating W(t)'W(t) = I gives 

wii) W() = -W)'W"(h), 


hence 
Z(t)* = W'(t)*W(t) = —Z(e). 


To recast (18.16) in terms of Z(t), note that (18.17) yields 


Z'(t) = W(t) 1W"(t) — W(t) W(t) W(t) W(t) 


18.19 7 
= W(t) WwW" (t) — Z(t) 
Now, given B € M(n,R), 
(18.20) Tr(BX) =0V X € Skew(N) => B= B". 


Hence the condition (18.16) is equivalent to the statement that 


(18.21) [Z'(t) + Z(t)*]Z, is symmetric. 
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If we denote the matrix in (18.21) by B and compute B — B*, we arrive at the 
following result. 


Proposition 18.3. If we define Z(t) by (18.17), the condition thatW : I > 
SO(n) be a critical path for (18.7) is equivalent to 


(18.22) Z(t) + TZ’ (t) + Z(t)’L, —L,Z(t)? = 0. 
To work on (18.22), let us define 

(18.23) L, : Skew(n) > Skew(n), £,X = at, +72 ,X). 

Then (18.22) can be written 

(18.24) 202) — ,240)"| =0, 

where, generally, [A, B] = AB — BA. In turn, if we set 

(18.25) M(t) = £,Z(t) = ; (ZW, za 7,2(t)) 

and note that 

(18.26) Zo 2 |= 2[M, ZI, 

we can recast (18.24) as 

(18.27) M'(t) = [M(t), Z(t)], 

or equivalently 

(18.28) M'(t) = [M(t),£5°M(t)], 


a system of ODE with a quadratic nonlinearity. The following result leads to valu- 
able information about M(t). 


Proposition 18.4. Suppose M,Z : I > Skew(n) satisfy (18.27), for t € I, that 
to € I, and My = M(to). Then there exists U : I - SO(n) such that 


(18.29) M(t) =U(t)MpU(t)“', tel. 


Proof. We produce a linear ODE for U(t). Denoting the right side of (18.29) by 
M(t) and differentiating gives 
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M’(t) = U'(t)MoU(t)~! — U(t)MoU(t)“!U' (t)U (t) 7 
(18.30) = U'(t)U(t)-' M(t) — MU’ (HU) 
= (M(t), Z()], 


provided Z = —U'U~", ie., 
(18.31) U'(t) = —Z(t)U(t). 


To obtain (18.29), take U to solve (18.31), with U(to) = J, verify that Z(t) € 
Skew(n) => U(t) € SO(n), and note that M(t) and M(t) satisfy the same ODE, 
with the same initial data. 


Note that (18.29) implies 
(18.32) |M()|| = |Moll, Vte J, 
Hence, as observed below (2.13), we have global solvability: 


Proposition 18.5. Given to € R and initial data M(to) = Mp € Skew(n), the 
system (18.28) has a unique solution for allt € R, M : R > Skew(n). 


Having a solution to (18.28), we can retrace our steps, obtaining Z(t) = 
£7‘ M(t), satisfying (18.22), and then solve the linear system 


(18.33) W'(t) =W(t)Z(t), W/(to) = Wo € SO(n), 
to obtain a critical path for (18.7). 
The identity (18.32) says the norm ||M(t)|| is a conserved quantity for solu- 


tions to (18.28). We record some other conserved quantities. 


Proposition 18.6. For each solution M : R — Skew(n) to (18.28) and each 
k EN, the quantities 


(18.34) Tr M(t)?* 
are independent of t. So is 

(18.35) Q,(Z(t), Z(t)), 
with Z(t) = L>*M(t). 

Proof. From (18.29) we have 


(18.36) M(t)* = U(t)Me*U(t)“?, 
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and taking traces yields (18.34). To get (18.35), note that, when W : R > SO(n) 
is a critical path for (18.7), then 


4 9 (W'(t), W'() = 2@,(W"(t), W(t) = 0, 


18.37 
(18.37) er 


the last identity by (18.11)-(18.12). Since Z(t) = W(t)~!W’'(t), (18.9) gives 
(18.38) Qo(Z(t), 2) = Qp(W'(t), W"(), 
and we have (18.35). 


NOTE. The conserved quantities listed in Proposition 18.6 include two quadratic 
forms in Z, namely 


Q,(Z,Z) = —Tr ZT,Z, 
(18.39) 1 
Qn(Z, Z) = i Tr(ZZ,+1,Z)’. 

For more on the case n = 3, including formulas for the solutions in terms of 
elliptic integrals, see Chapter 4 of [T]. The monograph [T3] ties in the treatment 
of rigid body motion with a general class of equations (both ODEs and PDEs) that 
can be interpreted as equations for geodesics on Lie groups (infinite dimensional 
in the cases that yield PDEs). Equations treated there include the Euler equations 
for ideal fluids (treated here in Chapter 17), and various others, such as the KdV 
equations. 


19. Relativistic motion 


Mechanical systems considered in previous sections were formulated in the 
Newtonian framework. The description of a particle moving subject to a force was 
given in terms of a curve in space (with a positive-definite metric), parameterized 
by time. In the relativistic set-up, one has not space and time as separate entities, 
but rather spacetime, provided with a metric of Lorentz signature. In particular, 
Minkowski spacetime is R* with inner product 


3 
(19.1) (x,y) = —aoyo + S> ajyj;, 
j=l 
given x = (20,...,23), y = (Yo,---, ys). The behavior of a particle moving in 


a force field is described by a curve in spacetime, which is timelike, that is, its 
tangent vector T satisfies (T, T’) < 0. We parameterize the curve not by time, but 
by arc length, so we consider a curve (7) satisfying 
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(19.2) (u(r),u(r))=—1, u(r) =2"(r). 


The parameter 7 is often called “proper time,” and u(r) the “4-velocity.” Such a 
curve x(7) is sometimes called a “world line.” 

Relativistic laws of physics are to be formulated in a manner depending only 
on the Lorentz metric (19.1), but contact is made with the Newtonian picture by 
using the product decomposition R* = R x R®, writing x = (t,7,), t = xo, and 
Ls = (21, 22,23). The “3-velocity” is v = dx, /dt. Then 


(19.3) u= (1, v), 


where, by (19.2), 


dt 1/2 
19.4 nce 
(19.4) ed ee Uae) 
with |v|? = vf + v3 + v3. In the limit of small velocities, 7 is close to 1. 


The particle whose motion is to be described is assumed to have a constant 
“rest mass” mo, and then the “4-momentum”’ is defined to be 


(19.5) p=mou. 
In terms of the decomposition (19.3), 
(19.6) | ae (mo7, mMoyv), 


where mv is the momentum in Newtonian theory. The replacement for Newton’s 
equation modv/dt = f is 


d 
(19.7) ee 
dt 
the right side being the “Minkowski 4-force.” 
Newtonian theory and Einstein’s relativity are related as follows. Define m by 
m = mMovy and, using (19.6) and (19.7), write 


(19.8) P= ee ae) _ Ge ay, 


ae? ae dr’ dt 


Then we identify fo = d(mv)/dt as the “classical force” and write the last 
expression as (f°, yfc). If (19.2) is to hold, we require f° = yfc - v (the dot 
product in Euclidean R*), so 


(19.9) F= (fav, fc). 


With this correspondence, the (19.7) yields Newton’s equation in the small veloc- 
ity limit. 
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Since the 4-velocity has constant length, by (19.2), the Minkowski 4-force F’ 
must satisfy 


(19.10) (F,u) =0. 


It follows that in relativity one cannot have velocity-independent forces. The sim- 
plest situation compatible with (19.10) is for F to be linear in u, say 


(19.11) F(a,u) = F(a)u, 


where for each « € R*, F (x) is a linear transformation on R*; in other words, F 
is a tensor field of type (1,1). The condition (19.10) holds provided F is skew- 
adjoint with respect to the Lorentz inner product: 


(19.12) (Fu, w) = —(u, Fw). 

Equivalently, if we consider the related tensor F of type (0, 2), 

(19.13) F(u,w) = (u, Fw), 

then F is antisymmetric, that is, F is a 2-form. In index notation, Fj, = hjeF a 
where /;;, defines the Lorentz metric. 

The electromagnetic field is of this sort. The classical force exerted by an elec- 
tric field & and a magnetic field B on a particle with charge e is the Lorentz force 
(19.14) fr =e(E+v~x B), 
as in (12.40). Using this in (19.9) gives, for u = (u°, v), 

(19.15) Fu = e(E-v, Eu? +v x B). 
Consequently the 2-form F is F(u, w) = e >> Fyvupwr with 


(19.16) (Fi) = E, -By 0 B 


E; Bo —B, 0 


In relativity it is this 2-form which is called the electromagnetic field. 
To change notation slightly, let us denote by F the 2-form described by (19.16), 
namely, with t = xo, 
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3 
(19.17) F = $0 Bj da; Adt+ By dr2 Adr3 + By daz \ day + Bz dx; A dep. 
j=l 
Thus the force in (19.11) is now denoted by eFu. 
We can construct a Lagrangian giving the equation of motion (19.7), (19.11), 


in a fashion similar to (12.44). The part of Maxwell’s equations for the electro- 
magnetic field recorded as (12.41) is equivalent to the statement that 


(19.18) dF = 0. 


Thus we can find a 1-form A on Minkowski spacetime such that 
(19.19) F=daA. 
Then we can set 
1 
(19.20) L(x,u) = 5 imo(u, u) +e(A,u), 


and the force law dp/dr = eF(x)u is seen to be equivalent to 


(19.21) 


See Exercise 3 below. In this case, the Legendre transform (12.13) becomes, with 


u? = (—u®,u!, u?, u3), 


(19.22) (x, €) = (2, mou? + eA), 


and we get the Hamiltonian system 


dx dé 
(19.23) a Ee, a Es 
with 
(19.24) E(a,€) = a (€-eA,E-eA). 
Exercises 


1. Consider a constant electromagnetic field of the form 


E=(1,0,0), B=0. 
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Work out the solution to Newton’s equation 


for the path z = 2(t) in R? of a particle of charge e, mass m, moved by the Lorentz 
force arising from this field. Then work out the solution to the relativistic equation 


d 
mo = =e(E-v, Eu +v~x B), 
with u = (u°, v) (having square norm —1), u = da:/dr, for the path in R* of a particle 
of charge e, rest mass mo, moved by such an electromagnetic field. Compare the results. 
Do the same for 
E=0, B=(1,0,0). 


. Take another look at Exercise 3 in §12. 

. Show that taking (19.20) for the Lagrangian implies that Lagrange’s equation (19.21) 
is equivalent to the force law dp/dr = eFu, on Minkowski spacetime. 

(Hint: To compute Dz, use 


d(A,u) = —(dA)]u+ L.A, 


regard u as independent of x, and note that dA/dr = VuA = L.A, in that case.) 
Compare Exercise 4 in §12. 

. Verify formula (19.16) for ¥,,,. Show that the matrix for F has the same form, except 
all E; carry plus signs. 

. An alternative sign convention for the Lorentz metric on Minkowski spacetime is to 
replace (19.1) by (x,y) = Zoyo — 7,31 yj. Show that this leads to a sign change 
in (19.16). What other sign changes arise? 

. Suppose a |-form A is given, satisfying (19.19), on a general four-dimensional Lorentz 
manifold M. Let L : TM — R be given by (19.20). Use the set-up described in 
(12.51)-(12.65) to derive equations of motion, extending the Lorentz force law from 
Minkowski spacetime to any Lorentz 4-manifold. 

(Hint: In analogy with (12.64), show that L,, is given by 


Ly =mou+ eA* 


where A* is the vector field corresponding to A via the metric (by raising indices). 
Taking a cue from Exercise 3, show that Lz satisfies 


Ly = eFut eViA*. 


Deduce that the equation 4 
moVuu = eFu 


is the stationary condition for this Lagrangian.) 
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20. Topological applications of differential forms 


Differential forms are a fundamental tool in calculus. In addition, they have impor- 
tant applications to topology. We give a few here, starting with simple proofs of 
some important topological results of Brouwer. 


Proposition 20.1. There is no continuous retraction y : B + S"~' of the closed 
unit ball B in R” onto its boundary S"~}. 


In fact, it is just as easy to prove the following more general result. The approach 
we use is adapted from [Kan]. 


Proposition 20.2. If M is a compact, oriented manifold with nonempty boundary 
OM, there is no continuous retraction py: M > OM. 


Proof. A retraction ¢ satisfies yp o j(x) = x, where j : OM © M is the natural 
inclusion. By a simple approximation, if there were a continuous retraction there 
would be a smooth one, so we can suppose y is smooth. 

Pick w € A”"~1(0M) to be the volume form on 0M, endowed with some 
Riemannian metric (n = dim M), so f, am & > 0. Now apply Stokes’ theorem to 
a = p*w. If py is a retraction, 7*p*w = w, so we have 


(20.1) fom fdete. 


OM M 


But dp*w = y*dw = 0, so the integral (20.1) is zero. This is a contradiction, so 
there can be no retraction. 


A simple consequence of this is the famous Brouwer fixed-point theorem. 


Theorem 20.3. [f F : B > B is a continuous map on the closed unit ball in R”, 
then F has a fixed point. 


Proof. We are claiming that F(x) = x for some x € B. If not, define v(x) to be 
the endpoint of the ray from F(x) to , continued until it hits 0B = $"~'. It is 
clear that y~ would be a retraction, contradicting Proposition 20.1. 


We next show that an even-dimensional sphere cannot have a smooth nonvan- 
ishing vector field. 


Proposition 20.4. There is no smooth nonvanishing vector field on S” ifn = 2k 
is even. 


Proof. If X were such a vector field, we could arrange it to have unit length, so 
we would have X : S” — S”, with X(v) L v forv € S” C R"*, Thus there is 
a unique unit-speed geodesic y, from v to X(v), of length 7/2. Define a smooth 
family of maps F; : S" — S” by Fi(v) = y(t). Thus Fo(v) = v, Fy2(v) = 
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X(v), and F,, = A would be the antipodal map, A(v) = —v. By (13.63), we 
deduce that A*w — w = d{ is exact, where w is the volume form on S”. Hence, 
by Stokes’ theorem, 


(20.2) [ae fo 
gn 


gn 


On the other hand, it is straightforward that A*w = (—1)"t1w, so (20.2) is pos- 
sible only when n is odd. 


Note that an important ingredient in the proof of both Proposition 20.2 
and Proposition 20.4 is the existence of n-forms on a compact, oriented, n- 
dimensional manifold MM which are not exact (though of course they are closed). 
We next establish the following important counterpoint to the Poincaré lemma. 


Proposition 20.5. If M is a compact, connected, oriented manifold of dimension 
nanda€ A"M, then w = d for some 3 € A"~1(M) if and only if 


(20.3) Jo = 0. 


M 


We have already discussed the necessity of (20.3). To prove the sufficiency, we 
first look at the case M = S”. 

In that case, any n-form a is of the form a(r)w, a € C™(S”), w the volume 
form on S$”, with its standard metric. The group G = SO(n + 1) of rotations of 
R"*? acts as a transitive group of isometries on S$”. In Appendix B, Manifolds, 
Vector Bundles, and Lie Groups, we construct the integral of functions over 
SO(n + 1), with respect to Haar measure. 

As noted in Appendix B, we have the map Exp : Skew(n + 1) + SO(n +1), 
giving a diffeomorphism from a ball O about 0 in Skew(n + 1) onto an open set 
U c SO(n+1) = G, aneighborhood of the identity. Since G is compact, we can 
pick a finite number of elements ; € G such that the open sets U; = {f)9: 9 € 
U} cover G. Pick n; € Skew(n+1) such that Exp 7; = €;. Define ®;, : Uj; + G 
forO <t <1 by 


(20.4) ®;,(€; Exp(A)) = (Exp tn;)(ExptA), Ae O. 
Now partition G into subsets (2, each of whose boundaries has content zero, such 


thatQ,; C U;.Ifg € Q,, set g(t) = ®;1(g). This family of elements of SO(n+1) 
defines a family of maps Fy; : S” — 5S”. Now, as in (13.60), we have 


1 
(20.5) e=iex dk,(a), Kg (a) = i Fy, (a] Xgt) dt, 
0 
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for each g € SO(n +1), where X,; is the family of vector fields on 5” generated 
by Foz, as in (13.58). Therefore, 


(20.6) a= Jae dg — af Kg(a@) dg. 
G 


G 


Now the first term on the right is equal to G@w, where @ = fa(g-x)dg isa 
constant; in fact, the constant is 


= 1 
sn 


Thus, in this case, (20.3) is precisely what serves to make (20.6) a representation 
of a as an exact form. This finishes the case M = S$”. 

For a general compact, oriented, connected /, proceed as follows. Cover 
with open sets O,,..., Ox such that each oF is diffeomorphic to the closed unit 
ball in R”. Set U; = Qj, and inductively enlarge each O,; to U;, so that U; is 
also diffeomorphic to the closed ball, and such that Uj41U; 40,1 <j < 
ix. You can do this by drawing a simple curve from O41 to a point in U; and 
thickening it. Pick a smooth partition of unity ;, subordinate to this cover. 

Given a € A" M, satisfying (20.3), take @; = pj. Most likely [ G;=ci #0, 
so take 0, € A"M, with compact support in U; M U2, such that fa = Cj. 
Set a1 = 1 — 01, and redefine dz to be the old Q2 plus o,. Make a similar 
construction using f Q2 = Cg, and continue. When you are done, you have 


(20.8) a=a,+---+aK, 


with a; compactly supported in U;. By construction, 


(20.9) / aj =0, 


for 1 < j < K. But then (20.3) implies [ ax = 0 too. 
Now pick p € 5” and define smooth maps 


(20.10) bi: M — 8”, 


which map U; diffeomorphically onto S” \ p and map M \ U; to p. There is a 
unique v; € A”.S”, with compact support in S” \ p, such that Y*v; = a,;. Clearly 


Juno. 


gn 
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so by the case M = S” of Proposition 20.5 already established, we know that 
v; = dw; for some w; € A”~'S™, and then 


(20.11) ay = dG;, 3; — 5 Wj. 


This concludes the proof. 

We can sharpen and extend some of the topological results given above, using 
the notion of the degree of a map between compact, oriented surfaces. Let X 
and Y be compact, oriented, n-dimensional surfaces. We want to define the degree 
of a smooth map F’ : X — Y. To do this, assume Y is connected. We pick 
w € A”Y such that 


(20.12) fe — 
Yy 
We want to define 
(20.13) Deg(F’) = [Fe 
x 


The following result shows that Deg(F’) is indeed well defined by this formula. 
The key argument is an application of Proposition 20.5. 


Lemma 20.6. The quantity (20.13) is independent of the choice of w, as long as 
(20.12) holds. 


Proof. Pick w; € A”Y satisfying fy, w; = 1, so fj, w—w; = 0. By Proposition 
20.5, this implies 


(20.14) w—w,=da, forsomea € A”'Y. 


Thus 


(20.15) [ren f re = para= 0, 
xX xX xX 


and the lemma is proved. 
The following is a most basic property. 
Proposition 20.7. If Fg and F, are homotopic, then Deg( Fo) = Deg(F;). 


Proof. As noted in Exercise 7 of §13, if fo and F, are homotopic, then Fo w — 
F¥w is exact, say dG, and of course le dZ = 0. 
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We next give an alternative formula for the degree of a map, which is very 
useful in many applications. A point yo € Y is called a regular value of F’ pro- 
vided that, for each x € X satisfying F(x) = yo, DF (x) : T,X — Ty, Y 
is an isomorphism. The easy case of Sard’s theorem, discussed in Appendix B, 
implies that most points in Y are regular. Endow X with a volume element wx, 
and similarly endow Y with wy. If DF'() is invertible, define JF(x) € R\0 
by F* (wy) = JF(ax)wx. Clearly the sign of JF() (i.e., sgn JF(x) = £1), is 
independent of the choices of wx and wy, as long as they determine the given 
orientations of X and Y. 


Proposition 20.8. [f yo is a regular value of F’, then 


(20.16) Deg(F) = 5“ {sgn JF(a;) : F(x;) = yo. 


Proof. Pick w € A”Y, satisfying (20.12), with support in a small neighborhood 
of yo. Then Fw will be a sum )> w;, with w; supported in a small neighborhood 
of z;, and fw; = +1assgn JF(2z;) = +1. 


The following result is a powerful tool in degree theory. 


Proposition 20.9. Let M be a compact, oriented manifold with boundary. 
Assume that din M = n+ 1. Given a smooth map F : M ~ Y, let 
f =Floy,:OM — Y. Then 


Deg(f) =0. 


Proof. Applying Stokes’ theorem to a = F’*w, we have 


[tea far. 


OM M 


But dF*w = F* dw, and dw = 0 if dim Y = n, so we are done. 


An easy corollary of this is another proof of Brouwer’s no-retraction theorem. 
Compare the proof of Proposition 20.2. 


Corollary 20.10. If M is a compact, oriented manifold with nonempty boundary 
OM, then there is no smooth retraction p : M > OM. 


Proof. Without loss of generality, we can assume that M is connected. If there 
were a retraction, then 0M = y(M) must also be connected, so Proposition 20.9 
applies. But then we would have, for the map id. = y| amp the contradiction that 
its degree is both 0 and 1. 


For another application of degree theory, let X be a compact, smooth, oriented 
hypersurface in R”*!, and set Q = R"*! \ X. (Assume n > 1.) Given p € Q, 
define 


_ &£—?P 
|x — pl 


(20.17) Bix—>S", Be) 
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It is clear that Deg(F,,) is constant on each connected component of 1. It is also 
easy to see that, when p crosses X, Deg(F;,) jumps by +1. Thus 2 has at least two 
connected components. This is most of the smooth case of the Jordan—Brouwer 
separation theorem: 


Theorem 20.11. Jf X is a smooth, compact, oriented hypersurface of R"*", 
which is connected, then Q = R"*! \ X has exactly two connected components. 


Proof. Since X is oriented, it has a smooth, global, normal vector field. Use this 
to separate a small collar neighborhood C of X into two pieces; C \ X = Cy UC,. 
The collar C is diffeomorphic to [—1, 1] x X, and each C; is clearly connected. 
It suffices to show that any connected component O of 2) intersects either Co or 
C,. Take p € 00. If p ¢ X, then p € Q, which is open, so p cannot be a boundary 
point of any component of (. Thus OO C X, so O must intersect a C;. This 
completes the proof. 


Let us note that, of the two components of (, exactly one is unbounded, say 
Qo, and the other is bounded; call it Q,. Then we claim that if X is given the 
orientation it gets as 001, 


(20.18) pe O; => Deg(F,) = 9. 


Indeed, for p very far from X, F, : X — S” is not onto, so its degree is 0. And 
when p crosses X, from Qo to 12), the degree jumps by +1. 

For a simple closed curve in R?, this result is the smooth case of the Jordan 
curve theorem. That special case of the argument given above can be found 
in [Sto]. 

We remark that, with a bit more work, one can show that any compact, smooth 
hypersurface in R"*" is orientable. For one proof, see Appendix B to Chap. 5. 

The next application of degree theory is useful in the study of closed orbits of 
planar vector fields. Let C' be a simple, smooth, closed curve in R?, parameterized 
by arc length, of total length L. Say C is given by x = y(t), y(t + L) = (t). 
Then we have a unit tangent field to C, T(y(t)) = 7/(t), defining 


(20.19) Pi 3-5", 


Proposition 20.12. For T given by (20.19), we have 
(20.20) Deg(T) = 1. 


Proof. Pick a tangent line £ to C such that C lies on one side of £, as in Fig. 20.1. 
Without changing Deg(T), you can flatten out C a little, so it intersects @ along 
a line segment, from 7(Lo) to y(ZL) = (0), where we take Lp = L — 2¢, 
Ty = L—e. 
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ft \ 


yLo) yL1) yO) 


FIGURE 20.1 Deformation of T 


Now T is close to the map T, : C — S', given by 


(20.21) Liv) = i 


for any s > 0 small enough; hence T' and T, are homotopic, for small positive s. 
It follows that T and T, are homotopic for all s € (0, L). Furthermore, we can 
even let s = s(t) be any continuous function s : [0, Z] — (0, Z) such that s(0) = 
s(L). In particular, T’ is homotopic to the map V : C — S', obtained from 
(20.21) by taking 

s(t) = Ty = t, fort € (0, Lol, 


and s(t) going monotonically from L; — Lo to Ly, for t € [Lo, L]. Note that 
27S Ty. 


The parts of V over the ranges 0 < t < Lo and Lo < t < LJ, respectively, 
are illustrated in Figs. 20.1 and 20.2. We see that V maps the segment of C’ from 
7(0) to 7(Lo) into the lower half of the circle $1, and it maps the segment of C 
from (Lo) to 7(L) into the upper half of the circle S'. Therefore, V (hence T) is 
homotopic to a one-to-one map of C' onto S', preserving orientation, and (20.20) 
is proved. 


The material of this section can be cast in the language of deRham cohomol- 
ogy, which we now define. Let 1/7 be a smooth manifold. A smooth k-form wu is 
said to be exact if u = dv for some smooth (& — 1)-form v, and closed if du = 0. 
Since d? = 0, every exact form is closed: 
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pis 


y@Lo) yy) yO) 


FIGURE 20.2 Further Deformation 


(20.22) é€*(M) c C*(M), 


where €*(M) and C*(M) denote respectively the spaces of exact and closed 
k-forms. The deRham cohomology groups are defined as quotient spaces: 


(20.23) H*(M) =C*(M)/E*(M). 


There are no nonzero (—1)-forms, so €°(M) = 0. A 0-form is a real-valued 
function, and it is closed if and only if it is constant on each connected component 
of M, so 


(20.24) H°(M) +R”, v=# connected components of M. 
An immediate consequence of Proposition 20.5 is the following: 


Proposition 20.13. [f M is a compact, connected, oriented manifold of dimen- 
sion n, then 


(20.25) H"(M) =R. 


Via the pull-back of forms, a smooth map F’ : X — Y between two manifolds 
induces maps on cohomology: 


(20.26) F*: HI(Y) — HI(X). 
If X and Y are both compact, connected, oriented, and of dimension n, then we 


have F* : H"(Y) + H”(X), and, via the isomorphism H"(X) ~ R = H"(Y) 
arising from integration of n-forms, this map is simply multiplication by Deg F’. 
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The subject of deRham cohomology plays an important role in material we 
develop later, such as Hodge theory, in Chap. 5, and index theory, in Chap. 10. 


Exercises 

1. Show that the identity map J : X — X has degree 1. 

2. Show that if F': X — Y is not onto, then Deg(F’) = 0. 

3. If A: S” — §” is the antipodal map, show that Deg(A) = (—1)""'. 

4. Show that the homotopy invariance property given in Proposition 20.7 can be deduced 


as a corollary of Proposition 20.9. (Hint: Take M = X x [0, 1].) 

5. Let p(z) = 2" + Qn—12"' +--+ -+a1z + ao be a polynomial of degree n > 1. Show 
that if we identify S? ~ C U {oo}, then p : C > C has a unique continuous extension 
p: S? — S?, with p(oo) = oo. Show that 


Deg p = n. 


Deduce that p : S? — S$? is onto, and hence that p : C — C is onto. In particular, each 
nonconstant polynomial in z has a complex root. 
This result is the fundamental theorem of algebra. 


21. Critical points and index of a vector field 


A critical point of a vector field V is a point at which V vanishes. Let V be a 
vector field defined on a neighborhood O of p € R”, with a single critical point, 
at p. Then, for any small ball B,. about p, B, C O, we have a map 


(21.1) V,:0B,3 8S", V(x) = 


The degree of this map is called the index of V at p, denoted ind, (V); it is clearly 
independent of r. If V has a finite number of critical points, then the index of V 
is defined to be 


(21.2) Index(V) = 5 ind,, (V). 


If % : O + O' is an orientation-preserving diffeomorphism, taking p to p and 
V to W, then we claim that 


(21.3) ind,(V) = ind,(W). 


In fact, Dy)(p) is an element of GL(n, R) with positive determinant, so it is homo- 
topic to the identity, and from this it readily follows that V,. and W,. are homotopic 
maps of 0B, — S"~+!. Thus one has a well-defined notion of the index of a vector 
field with a finite number of critical points on any oriented manifold M/. 
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A vector field V on O C R” is said to have a nondegenerate critical point 
at p provided DV(p) is a nonsingular n x n matrix. The following formula is 
convenient. 


Proposition 21.1. [f V has a nondegenerate critical point at p, then 

(21.4) ind,(V) = sgn det DV (p). 

Proof. If p is a nondegenerate critical point, and we set w(a) = DV(p)z, 
vr(x) = w(x)/|v(a)|, for « € OB,, it is readily verified that 7, and V, are 


homotopic, for r small. The fact that Deg(w,,.) is given by the right side of (21.4) 
is an easy consequence of Proposition 20.8. 


The following is an important global relation between index and degree. 


Proposition 21.2. Let Q be a smooth bounded region in R"*". Let V be a vector 
field on Q, with a finite number of critical points pj, all in the interior Q. Define 
F:0Q— > S” by F(a) = V(a)/|V(a)|. Then 


(21.5) Index(V) = Deg(F). 


Proof. If we apply Proposition 20.9 to M = 2\ U; B-(p;), we see that Deg( F’) 
is equal to the sum of degrees of the maps of OB-(p;) to S$”, which gives (21.5). 


Next we look at a process of producing vector fields in higher-dimensional 
spaces from vector fields in lower-dimensional spaces. 


Proposition 21.3. Let W be a vector field on R”, vanishing only at 0. Define a 
vector field V on R"** by V(x, y) = (W(a),y). Then V vanishes only at (0,0). 
Then we have 


(21.6) indyW = ind«yV. 


Proof. If we use Proposition 20.8 to compute degrees of maps, and choose yo € 
Sr-l co gmtk-l a regular value of W,., and hence also for V,., this identity 
follows. 


We turn to a more sophisticated variation. Let X be a compact, oriented, 
n-dimensional submanifold of R”+*, W a (tangent) vector field on X with a 
finite number of critical points p;. Let 2 be a small tubular neighborhood of 
X, 7: Q— X mapping z € © to the nearest point in X. Let p(z) = dist(z, X)?. 
Now define a vector field V on 2 by 


(21.7) V(z) = W(r(z)) + Vez). 
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Proposition 21.4. If F : 0Q > S"*+*~! is given by F(z) = V(z)/|V(z)|, then 


(21.8) Deg(F’) = Index(W). 


Proof. We see that all the critical points of V are points in X that are critical 
for W, and, as in Proposition 21.3, Index(W) = Index(V). But Proposition 21.2 
implies that Index(V) = Deg(F’). 


Since y(z) is increasing as one moves away from X, it is clear that, for z € 
0Q, V(z) points out of 2, provided it is a sufficiently small tubular neighborhood 
of X. Thus F : 0Q + $”"**—! is homotopic to the Gauss map 


(21.9) Neo — ere, 
given by the outward-pointing normal. This immediately gives the next result. 


Corollary 21.5. Let X be a compact oriented manifold in R"*+* , Qa small tubu- 
lar neighborhood of X, and N : 0Q.-+ S"+*-" the Gauss map. If W is a vector 
field on X with a finite number of critical points, then 


(21.10) Index(W) = Deg(N). 


Clearly, the right side of (21.10) is independent of the choice of W. Thus any 
two vector fields on X with a finite number of critical points have the same index, 
that is, Index(W) is an invariant of X. This invariant is denoted by 


(21.11) Index(W) = y(X), 


and is called the Euler characteristic of X. See the exercises for more results on 
x(X). A different definition of y(X) is given in Chap. 5. These two definitions 
are related in §8 of Appendix C, Connections and Curvature. 


Exercises 


In Exercises 1-3, V is a vector field on a region 2 C R?. A nondegenerate critical 
point p of a vector field V is said to be a source if the real parts of the eigenvalues of 
DV (p) are all positive, a sink if they are all negative, and a saddle if they are all either 
positive or negative, and there exist some of each sign. Such a critical point is called a 
center if all orbits of V close to p are closed orbits, which stay near p; this requires all 
the eigenvalues of DV (p) to be purely imaginary. 

1. Let V have a nondegenerate critical point at p. Show that 


psaddle => ind,(V) = —1, 
psource => ind,(V) =1, 


psink => ind,(V) = 1, 
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pcenter => ind,(V) =1. 


2. If V has a closed orbit y, show that the map T': y > S', T(x) = V(a)/|V(a)|, has 


3. 


degree +1. (Hint: Use Proposition 20.8.) 

If V has a closed orbit y whose inside O is contained in 2, show that V must have at 
least one critical point in O, and that the sum of the indices of such critical points must 
be +1. (Hint: Use Proposition 21.2.) 

If V has exactly one critical point in O, show that it cannot be a saddle. 

Let M be acompact, oriented surface. Given a triangulation of /, within each triangle 
construct a vector field, vanishing at seven points as illustrated in Fig. 21.1, with the 
vertices as attractors, the center as a repeller, and the midpoints of each side as saddle 
points. Fit these together to produce a smooth vector field _X on M. Show directly that 


Index(X) =V—-—E+F, 
where 
V =# vertices, E=#edges, F = # faces, 


in the triangulation. 

More generally, construct a vector field on an n-simplex so that when a compact, ori- 
ented, n-dimensional manifold MM is triangulated into simplices, one produces a vector 
field _X on M such that 


nn 


(21.12) Index(X) = $°(-1)’v, 


6. 


j=0 


where v7; is the number of j-simplices in the triangulation, namely, 19 = # vertices, 
Vy, = # edges, ...,Un = # of n-simplices. (See Fig. 21.2 for a picture of a 3-simplex, 
with its faces (i.e., 2-simplices), edges, and vertices labeled.) 


The right side of (21.12) is one definition of (JZ). As we have seen, the left side 
of (21.12) is independent of the choice of X, so it follows that the right side is 
independent of the choice of triangulation. 

Let M be the sphere S”, which is homeomorphic to the boundary of an (n-+1)-simplex. 
Computing the right side of (21.12), show that 


(21.13) x(S") = 2ifneven, Oifn odd. 


FIGURE 21.1 Vector Field on a Triangulation 
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FIGURE 21.2 A 3-Simplex 


Conclude that if n is even, there is no smooth nowhere-vanishing vector field on S”, 
thus obtaining another proof of Proposition 20.4. 

7. With X = S” C R”*", note that the manifold OQ in (21.9) consists of two copies of 
S”, with opposite orientations. Compute the degree of the map N in (21.9) and (21.10), 
and use this to give another derivation of (21.13), granted (21.11). 

8. Consider the vector field R on S” generating rotation about an axis. Show that R has 
two critical points, at the “poles.” Classify the critical points, compute Index(R), and 
compare the n = 2 case of (21.13). 

9. Show that the computation of the index of a vector field X on a manifold M is inde- 
pendent of orientation and that Index(X ) can be defined when M is not orientable. 


A. Nonsmooth vector fields 
Here we establish properties of solutions to the ODE 


dy 


(A.1) an 


F(t, y); y(to) = ro 

of a sort done in §§2—6, under weaker hypotheses than those used there; in par- 
ticular, we do not require F' to be Lipschitz in y. For existence, we can assume 
considerably less: 


Proposition A.1. Let xp € O, an open subset of R", I C R an interval contain- 
ing to. Assume F is continuous on I x O. Then the (A.1) has a solution on some 
t-interval containing to. 


Proof. Without loss of generality, we can assume F’ is bounded and continuous 
on R x R”. Take F; € C™(R x R”) such that |F;| < K and F; — F locally 
uniformly, and let y; € C'°(R) be the unique solution to 


dy; 


(A.2) ens 


= F;(t,y), yj (to) = Lo, 
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whose existence is guaranteed by the material of §2. Thus 


t 
(A.3) y,;(t) = x0 +f F;(s, y;(s)) ds. 
to 
Now 
(A.4) [Fi] < K => |yj(t’) — yj(t)| < Kt — tl]. 


Hence, by Ascoli’s theorem (see Proposition 6.2 in Appendix A, Functional 
Analysis) the sequence (y;) has a subsequence (y;,) which converges locally 
uniformly: y;, — y. It follows immediately that 


t 

(A.5) yt) =0o+ | F(s,u(s)) ds, 
to 

so y solves (A.1). 


Under the hypotheses of Proposition A.1, a solution to (A.1) may not be 
unique. The following family of examples illustrates the phenomenon. Take 
a € (0,1) and consider 


(A6) 4 = iit, v0) =0. 
Then one solution on [0, 00) is given by 

(A.7) yo(t) = (1 — a)!/G-4) ¢1/G-a) 
and another is given by 


Note that, for any ¢ > 0, the problem dy/dt = |y|*, y(0) = & has a unique 
solution on t € [0, 00), and limz_.o y-(t) = yo(t). Understanding this provides 
the key to the following uniqueness result, due to W. Osgood. 

Let w: Rt + R* be a modulus of continuity, i.e., w(0) = 0, w is continuous, 
and increasing. We may as well assume w is bounded and C'® on (0, 00). 


Proposition A.2. In the setting of Proposition A.1, assume F is continuous on 
I x O and that 


(A.8) F(t, y1) — F(t, y2)| < w(ly1 — yal), 


for allt € I, y; € O. Then solutions to (A.1) (with range in O) are unique, 
provided 
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1 ds 
a rotate 


Proof. If y:(¢) and y2(t) are two solutions to (A.1), then 


(A.10) yi(t) — yo(t) = i {F(s,y1(s)) — F(s, yo(s)) } ds. 


Let us set 0(t) = |yi(t) — yo(t)|. Hence, by (A.8), for t > to, 


(A.11) 6(t) < fs) ds. 


to 
In particular, for each e > 0, A(t) < i w(0(s) + €) ds. Since we are assuming 
w is smooth on (0, 00), we can apply the Gronwall inequality, derived in (5.24)— 
(5.26), to deduce that 
(A.12) A(t) < y(t), Vt>to, e>0, 


where yz is uniquely defined on [tg, co) by 


(A.13) g(t) =w(pe(t) +e), Ye(to) = 0. 
Thus 

pe(t) dc 
(A.14) | we +6) =t—t. 


Now the hypothesis (A.9) implies 


A.l li i= tet 
( 5) eNO Pe ( ) 0, Vv = 40; 


so we have 6(t) = 0, for all t > to. Similarly, one shows 0(t) = 0, fort < to, and 
uniqueness is proved. 


An important example to which Proposition A.2 applies is 
1 1 
(A.16) w(s)=slog-, s< 5" 
8 


This arises in the study of ideal fluid flow, as will be seen in Chap. 17. 
A similar argument establishes continuous dependence on initial data. If 


yj 


(A.17) a 


= Fit, yj), yj (to) = vy, 
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then 
(A.18) y(t) — yo(t) = 21 — 22 +f {F(s,y1(s)) — F(s, yo(s)) } ds, 


So 12(t) = |yi(t) — yo(t)| satisfies 


t 
(A.19) O12(t) < \xy oa x9 +f w(12(s)) ds. 


to 


An argument similar to that used above gives (for t > to) 
(A.20) O12(t) < O(la1 — £9|, ¢), 


where, fora > 0, t > tg, (a, t) is the unique solution to 


(A.21) O08 = w(9), V(a,to) =a, 
that is, 

0(a,t) d¢ 
A.22 —~ =t-tp. 
os [gent 


Again, the hypothesis (A.9) implies 


; i — > to. 
(A.23) pees W(a,t)=0, Vt>to 


By (A.20), we have 
(A.24) [yi (t) — yo(t)| < P(|a1 — aol, t), 


for all t > to, and a similar argument works for t < to. 
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The Laplace Equation and Wave 
Equation 


Introduction 


In this chapter we introduce the central linear partial differential equations of the 
second order, the Laplace equation 


(0.1) Au=f 


and the wave equation 


fag 
(0.2) ( oe ) u=f 
For flat Euclidean space R”, the Laplace operator is defined by 
Oru O7u 
0.3 Au= 335 4+°:'+ 55: 
o2) On? ae Ox? 


The wave equation arose early in the history of continuum mechanics, in a 
mathematical description of the motion of vibrating strings and membranes. We 
discuss this in §1. The analysis, based on an appropriate version of Hamilton’s 
stationary action principle, generally produces nonlinear partial differential equa- 
tions, of a sort that will be studied more in Chaps. 14-16. The wave equation 
described by (0.2), which is linear, arises as a “linearized” PDE, describing such 
vibratory motion, as will be seen in §1. 

In this chapter we consider the Laplace operator on a general Riemannian man- 
ifold and emphasize concepts defined in a coordinate-independent fashion. Also, 
more generally than the wave equation (0.2) on the Cartesian product of a spa- 
tial region with the time axis, we consider natural generalizations defined on a 
manifold endowed with a Lorentz metric. 

Before defining the Laplace operator on Riemannian manifolds, we devote two 
sections to some first-order operators. In 82 we discuss the divergence opera- 
tor applied to vector fields, and in §3 we generalize the operations of covariant 
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derivative and divergence from vector fields to tensor fields. These concepts play 
important roles in the study of the Laplace and wave equations. 

In 84 we define the Laplace operator acting on real- (or complex-) valued func- 
tions on a Riemannian manifold M, and in §5 we write down the wave equation 
for functions on R x M and discuss energy conservation. In §6 we extend energy 
identities in a way that leads to proofs of results on finite propagation speed for 
solutions to such a wave equation. 

In §7 we extend the notion of the wave equation from R x M to a general 
Lorentz manifold. We extend the notion of energy conservation. To a solution 
of the wave equation is associated a second-order tensor field, the “stress-energy 
tensor,’ and the law of conservation of energy can be expressed as the vanishing 
of the divergence of this field, as is shown in §7. One can pass from such a “local” 
conservation law to an integral conservation law via the divergence theorem, for 
a certain class of Lorentz manifolds, namely those with a timelike Killing field. 
We derive the phenomenon of “finite propagaton speed” for solutions to the wave 
equation as a consequence of such a conservation law. 

In §8 we consider a more general class of hyperbolic equations. To solutions 
we can still associate a tensor with some of the properties of a stress-energy tensor, 
but the energy conservation law may not hold, and instead we look for “energy 
estimates.” 

The Stokes formula used in §2 to derive the divergence theorem is a special 
case of a more general Stokes-type formula, which we discuss in §9. This more 
general formula is used in §10 to produce a variant of Green’s formula for the 
Laplace operator acting on differential forms. In these sections we also make use 
of the notion of the “principal symbol” of a differential operator, as an invariantly 
defined function on the cotangent bundle. 

In 811 we look at Maxwell’s equations for the electromagnetic field. We show 
how they can be manipulated to yield the wave equation. This mathematical fact 
will be further exploited in Chap. 6. We deal with Maxwell’s equations in the 
framework of relativity and work with the electromagnetic field on a general 
Lorentz 4-manifold. 

Though we discuss some qualitative properties of solutions to the Laplace 
equation and the wave equation, such as Green’s identities and finite propagation 
speed (in the case of the wave equation), we do not tackle the question of exis- 
tence of solutions in this chapter, except for the very simplest case, namely the 
n=1 case of (0.2), treated in §1. In the case of such equations on flat Euclidean 
space, Fourier analysis provides an adequate tool to construct and analyze solu- 
tions, and this will be developed in the next chapter. Then functional analytical 
methods, centered on the theory of Sobolev spaces, will be developed in Chap. 4 
and applied in subsequent chapters. As we will see in Chap. 6, energy estimates, 
such as those derived in 88 of this chapter, in concert with Sobolev space theory, 
form the principal tools for existence theorems for linear hyperbolic equations. 
Existence of solutions to nonlinear hyperbolic equations, which requires some- 
what more subtle analysis, will be studied in Chap. 16. 
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The problem of describing the motion of a vibrating string was one of the earliest 
problems of continuum mechanics, producing a partial differential equation. Such 
a PDE can be derived by a procedure similar to that described in §12 of Chap. 1, 
using a stationary action principle. To carry this out, we need formulas for the 
kinetic energy and the potential energy of a vibrating string. 

Suppose our string is vibrating in R*; say its ends are tied down at two points, 
the origin 0 and a vector Le; € R*, of length L. We suppose the string is uniform, 
of mass density m (1.e., total mass mL). The motion of the string is described by 
a function u = u(t,xz), t € R, a € (0, L], taking values in R* and satisfying 
u(t,0) =0, u(t, L) = Le, for all t. Then the kinetic energy at time t is given by 


oF, 
at) r= 2 | lay (t,¢)[? a, 
0 


and the integral fis T(t) dt is given by 
(1.2) Jo(u) = > |up(t, x)|? da dt, 
LxXQ 
where I = (to, t1), Q = (0, L). 
As for the potential energy at a given time ¢, we will use the law that the 


potential energy in a small piece of string is a function of the degree that the 
string has been stretched, namely, 


L 
(1.3) Vo= | fluc(t,x)) dx 
0 
for a function 
(1.4) f: RF SR. 
This is known as Hooke’s law. The case of an “ideal” string (where the force 


exerted by a small piece of string is proportional to the amount by which it has 
been stretched) is 


(1.5) f(y) = o(lyl — a)’, 


where the unstretched string has length aL < Lando > 0 is a given constant. 
The term accompanying (1.2) in the expression for the action is 


(1.6) nu) = ff f(uslt.2)) dx: dt. 


IxQ 
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The stationary condition according to Hamilton’s principle is 


a 
(1.7) —(Jo — Ji)(u+ sv)| 


ds o 


s=0 _ 


for all v € C§°(I x 0, R*). A simple computation gives 


d 
.e Jo(u+ sv)| = 7 muzv, dx dt 
(1.8) TxQ 


= -| mour dx dt, 


where the last identity is obtained by integration by parts. Furthermore, also inte- 
grating by parts, we have 


<“ Jy(ut+ sv)| = // fF (tele, 2) Uz (t, x) dx dt 


(1.9) 
0 
7 -ff { sf" (uel(t,2))} -v(¢,2) de dt. 
Note that 
(1.10) 2 f' (ue(t, x)) = f'" (Ue) Uae 
By 


where f” (y) is the k x k matrix valued function of second-order partial derivatives 
of f: R* — R, and uz, takes values in R*. In other words, 


d 
(1.11) qa aut 80)|,9 == ff Fs) -v da dt. 


IxXQ 


Combining (1.8) and (1.11), we see that the stationary condition (1.7) is equivalent 
to the partial differential equation 


(1.12) muy — f" (Uz)Ure = 0. 
If f(y) is a second-order polynomial in y, that is, of the form 
(1.13) fy) =a+b-yt+ Ay-y, 


where a € R,b € R*, and A is areal, symmetric, k x k matrix, then f(y) = 2A, 
and the PDE (1.12) becomes 


(1.14) mur — 2AUre = 0. 
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The example (1.5) does not satisfy this condition, and the resulting PDE is not 
linear. Let us rewrite this PDE, setting 


(1.15) u(t, xv) = ve, + w(t, 2), 


so that w(t, 0) = 0 and w(t, L) = 0 in R*. Then 


(1.16) J(u) = Ky(w) = ff olwe) dr a 

where y : R* — R is given by 

(1.17) oly) = Fler +y), 

and the corresponding PDE for w is 

(1.18) muy — 9" (We )Wee = 0. 

The linearization of this equation is, by definition, obtained by replacing y(y) 


by its quadratic part, that is, by the terms of order < 2 in its power series about 
y = 0: 


1 
(1.19) poly) = ao + bo + y+ a Aoy ys 


where ap = (0) = f(e1), bo = y’(0) = f’(e1), and Ap = y’’(0) = f”(e1). For 
one reason why the term “linearization” is appropriate, see Exercise 4 at the end 
of this section. If y is replaced by yp in (1.16), the stationary condition yields the 
linear PDE 


(1.20) muy — ApWrer = 0 (Ao — y"(0)). 
In the case of an ideal string (1.5), this linearized PDE is readily computed to be 
(1.21) muy — 20(1 — aP) Wer = 0, 
where P is the orthogonal projection of R* onto the orthogonal complement of 
€1. (Compare the calculations (1.43)—(1.47) and (1.51)—(1.55) below.) Recall that 
we are assuming 0 <a <1. 

For this linear equation, we can write w = w? + w*, where w? is parallel to 
e, and w* is orthogonal to e;. The equation (1.21) decouples, and we have 


(1.22) mw, —2ow®,, =0 


as the equation for the longitudinal wave w® and 
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(1.23) mui —20(1 — a)wt, = 0, 


as the equation for the transverse wave w*. Both of these equations are cases 
(with different values of c) of the wave equation 


(1.24) Vit — C’Ugz = 0. 


Here c is identified with the propagation speed for solutions to (1.24), for the 
following reason. Namely, for any C-functions f; of one variable, 


(1.25) u(t, x) = fi(a + ct) + fo(a — ct) 


is a solution to (1.24). Conversely, the general solution to (1.24) on (¢,2) € RxR, 
satisfying the initial conditions 


(1.26) v(0,2) = g(a), u(0,2) = A(x), 


can be expressed in the form (1.25). Indeed, a solution to (1.24) in the form (1.25) 
satisfies these initial conditions if and only if 


(127) f(a) + folx) = g(e) and ef{(2) — ef3(2) = h(e). 


This implies fj (x) + fS(x) = g’(a), so we can solve algebraically for f{ and f; 
thus we can set 


f(z) = 59(2) + an h(s) ds, 
fo(z) = are - an h(s) ds. 


2 
That the solution (1.25) so produced is the only solution to (1.24) satisfying the 
initial conditions (1.26) is a special case of a uniqueness result proved in §5. 
One can arrange that the boundary condition 


(1.28) 


(1.29) v(t, 0) = v(t, L) =0 
be satisfied by taking g and h that satisfy 


(1.30) g(s) = g(s + 2L) = —g(-s), h(s) =h(s+ 2L) = —h(-s). 


This is a special case of the method of images, discussed further in Chap. 3, §7. 

Whenever one has the linear equation (1.14), if A is a positive-definite matrix, 
one can diagonalize A and construct solutions as above. Constructing solutions 
for the equation (1.12), or (1.18) in the nonlinear case, is much more difficult; 
Chap. 16 gives some results for this problem. 
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Now we look at the higher-dimensional case, of a vibrating membrane. Let 
Q be some open region in R”. We consider vibrations of 2 in R*, with k > n. 
Define the inclusion j : R” > R* by 


Tipe cee) = (Pigee gyi Uy ewg ll) 
This time suppose the boundary of 2 is tied down. The motion of the membrane 
is described by a function u = u(t,z),t € R,x € O, taking values in R* and 


satisfying u(t, 2) = j(#) for 2 € OQ. We suppose the membrane is of a uniform 
substance, with mass density m. The kinetic energy at a given time t¢ is then 


(1.31) T(t) = > f luet.2)P de, 
Q 


parallel to (1.1), and the integral i/o t) dt = Jo(w) is again given by (1.2), with 
Q now an n-dimensional domain. As - the potential energy, we will again work 
under the hypothesis that it is a function of the “stretching” of the membrane, of 
the form 


(1.32) V@) = | f(uclt,x)) dz 
| 


where, for each (t,x) € Rx Q, 

(1.33) ug(t, x) € £(T,O,R*) = L(R”, R*) 

is the x-derivative, and 

(1.34) f: £2(R",R*) —R 

is a given smooth function. Again i V(t) dt = J,(u) is given by (1.6), the 
stationary action principle takes the form (1.7), and the variation of Jo(w) is given 


by (1.8). The variation of J,(u) is also given by a formula of the form (1.11). 
More precisely, if we set 


(1.35) f=fy), y= (w;) €£(R",R*), 


then (1.11) holds, with the interpretation 


n 


: OF O fue) 
(1.36) f" (Ua) Ure V = y OF els 
Pe pres OY niOYr5 
where u = (u,...,u*), v = (v!,...,v*) © R*. With this notation, the PDE 


obtained for wu is again of the form (1.12). 
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As in (1.15)-(1.17), we can concentrate on the deviation of u from the map 
7:2 R*. Set 


(1.37) u(t, x) = j(x) + w(t, x), 


so the boundary condition becomes w(t, 7) = 0 for « € OQ; then the PDE for w 
is of the form (1.18), again interpreted as in (1.36), with 


(1.38) go(y) = fity), 


for y € £(R", R*). As before, we have the linearized PDE 


(1.39) mur — Azz =0, A=y"(0), 
where, for w = (w',...,w*), 

De 
(1.40) (Awe2)” = » »y ae oe ae 


We can regard A as defining a symmetric bilinear map 
(1.41) A: L(R",R*) x £(R",R*) — R. 


There are a number of different forms the potential energy function f(y) can 
take, depending on the physical properties of the membrane. In a number of mod- 
els, one has f(y) = v(y*y), a function invariant under conjugating y*y by an 
orthogonal n x n matrix. These models have the form 


(1.42) f(y) = U(Tr ai (y*y),---, Tr gx (y*y)), 


where ge : R — R is smooth and, for a self adjoint matrix z = y*y, ge(z) 
is defined by the spectral representation; ge(z)v; = ge(A;)v; for v; in the 
A,-eigenspace of z. There is no loss in generality in assuming g¢(1) = 0. 

To compute the linearized PDE when f(y) is given by (1.42), start with 


(i +y*)G+y)) =gell+ ey t yi +y*y) 
(1.43) = ge LI + AUG y + y75 + yy) 


1 ok * 
+ 59 (U)G*y ty j)? + O((lyll®). 


If (1/2)r =Tr j*y = Tr y*j, o = Tr y*y, and y = Tr(j*y + y*7), we obtain 
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1 
oy) = Fi ty) = U(0) +S de(0) [9:(1) (7 +o) + 59 (1) 


+S¢ eee + O(\lyll*). 


(1.44) 


Thus the purely quadratic part, which yields the linearized PDE, is 


1 
oo Ue y*y + 598 (LTH *y + 99)" 
ba 


=ATry*y+ BIr(j*yty*j)? + O(Tr(j* y+y *j))”. 


As in the case of the linearized equations of the vibrating string, the resulting 
linear PDE decouples into an equation for the components of w orthogonal to the 
space R” C R* in which Q sits and an equation for the components of w parallel 
to this space. For the orthogonal component w*, since j*w* = 0 in this case, we 
can replace yo(y) by 


(1.46) y¥(y)=ATry*y, ye £(R",R*™”). 
In this case, we have 


Opt 


(1.47) Gee 
Oy piOYv5 


= 2AbijSuv- 


Hence the linearized equation for the orthogonal (or transverse) wave is 
(1.48) mui, — 2AAw* = 


where A is the Laplace operator on R”: 
(1.49) Av(2) = = 


If A > 0, we can rewrite (1.48) in the form 
(1.50) ve — Av = 0. 


The equation (1.50) is typically called “the wave equation.” As in (1.24), c is the 
propagation speed for waves satisfying (1.50); we will discuss this further in §6. 
The construction of solutions to (1.50), satisfying initial conditions of the form 
(1.26), is not as elementary for n > 1 as the construction for n = 1 given by 
(1.25)-(1.28). In Chap. 3, we will give a construction, valid for Q = R”, using 
Fourier analysis. A symmetry trick similar to (1.30) will work if is a rectan- 


146 2. The Laplace Equation and Wave Equation 


gular solid in R”, though not for general bounded regions (2. The existence and 
uniqueness of solutions to the wave equation (1.50) for such more general 22 are 
proven in Chap. 6. 

The equation for the components of w parallel to the plane R” of 2 C R*, in 
this case, has a somewhat different form, as we now compute. Note that this case 
is the same as considering the entire linearized PDE for the case k = n. Then 
4 is the identity map, so the linearization is of the form (1.39)-(1.40), with y(y) 
replaced by 


* * *\\ 2 
yy) =ATry*y+ BTr(yt+y*)? + C(Trly + y*)) 


(1.51) ‘ 
=(A+2B)Try*y+2B Try? +4C(Try), 


since Tr y*y = Tr yy* and Tr y? = Tr (y*)?, for a real n x n matrix y. If we 
denote the sum of the three terms on the last line in (1.51) by 


voy) + diy) + Y2(y), 
then, as in (1.47), 
Po 
(1.52) Dyabuny = (2A4+ 4B)di;bnv- 


Also, a brief computation gives 


uy 
(1.53) $$ — = 4Bb 750i 
OypiOYr; LJ 
and 
(1.54) ba = 806, .6.,; 
‘i Dy iO; piPyg- 


Now, when ¢ is replaced by wo, the differential operator of the form (1.40) is 
(2A + 4B)A, similar to the computation giving (1.48). When ¢ is replaced by 
w1 + we, the differential operator becomes 


(Lw)” =4B 7 by j6iwhie, +8C D> bud. jwh 0, 
(1.55) H,t,j=1 p,i,j=1 


= (4B+8C)) > ui, ,.,- 
J 


We can write this as 


(1.56) Lw = (4B + 8C) grad div w, 
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where the divergence of the vector field w = (w',...,w™) is 
Ow) 
1.57 dvw= Sy —, 


and, as before, the gradient of a real-valued function on R” is 


Ou Ou ) 


(1.58) grad u = Cee 


Thus the linearized PDE for vibration in the plane of 2 is 
(1.59) muy — (24+ 4B)Aw — (4B + 8C) grad div w = 0. 


The situation where ‘| = n represents a vibrating elastic solid, and the equation 
(1.59) is known as the equation of linear elasticity. 

In linear elasticity it is common to linearize about an unstrained state. One 
writes (1.59) as 


muy — wAw — (A+ pL) grad div w = 0; 
pe = 2A+4B and \ = 8C are called Lamé constants. For more on this, see [MH]. 


We will concentrate primarily on linear equations in this chapter, indeed, 
on scalar equations like (1.50). Methods of Chap.16 will yield results on 
nonlinear equations of the form (1.12), in any number of x-variables, under a 
“hyperbolicity” assumption, which is that, for some C' > 0, 


(1.60) > S- ee E:€jTuTy = CIE? calgt 


wjv=li,j=1 


for € € R”", t € R*. A sufficient, though not necessary, condition for this to hold 
is that f be a strongly convex function of y. For example (in the case k = n), 
(1.60) holds for 


(1.61) f(y) =aTry*y+oTry? 


whenever a > max(0, —0), but such f is strongly convex only if a > ||. 

The notions of divergence, gradient, and Laplacian given above are for the case 
of Euclidean space R”. All these notions extend to more general Riemannian 
manifolds. The Laplacian will be defined in such a way as to generalize the 
identity 
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(1.62) [awe dx = -| grad u- grad v dz, 


R” R” 


for u,v € C§°(R”), which follows from the definition (1.49) by integration by 
parts. A further identity that generalizes to the case of Riemannian manifolds is 


(1.63) Au = div grad u, 


which for a real-valued function on R” follows immediately from the definitions 
of div, grad, and A given above. 

We will discuss extensions of these concepts to Riemannian manifolds in the 
next few sections, starting with the notion of divergence in §2. Then we will derive 
a number of properties of solutions to wave equations, in §§5-—8, and also discuss 
an extension of the wave equation (1.50) from the case R x R” to Lorentz mani- 
folds. The problem of proving existence of solutions will be tackled only in later 
chapters. 

We will state here more precisely what the basic existence problem is. In the 
case of one of the wave equations produced above, say 


O7u 


(1.64) ae ~ Au=0, 


we desire to find wu satisfying this PDE, given initial conditions 
(1.65) u(0,2) = f(x), uz(0,2) = g(x). 


If OQ F 0, we also need to impose a boundary condition. There is in particular 
the Dirichlet condition 


(1.66) u(t,v) =0, fora € OO, 


in the case of a membrane tied down along OQ, as discussed above. There are other 
boundary conditions that arise in other situations, such as the Neumann boundary 
condition described in §5, and others mentioned in subsequent chapters. We also 
can replace (1.64) and (1.66) by nonhomogeneous equations, that is, replace the 
zeros on the right by given functions. 

In this section we have concentrated on evolution equations, involving motion 
with the passage of time. It is also of interest to study stationary problems, where 
there is no time dependence. In other words, one looks for stationary points for 


(1.67) J(u) = | f(uz(x)) da. 
| 


Thus one obtains a PDE of the form 
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(1.68) Ff" (Ua) Ure = 0, 


interpreted via (1.36), as the stationary condition for J(u). In the case f(u,) = 
|u|", this becomes the Laplace equation 


(1.69) Au 


l| 
2 


A typical boundary condition is the nonhomogeneous Dirichlet condition 


(1.70) u=wondn. 


The existence of a solution to this will follow from results of Chap. 5. 


Exercises 


1. Compare the formulas (1.22) and (1.23) for longitudinal and transverse waves. For a 
piano wire, a is very close to 1. What does this imply about the relative propagation 
speeds of longitudinal and transverse waves along a piano wire? Which type of waves 
produce audible sounds? 

2. Fora function f appearing in (1.60), to be strongly convex means 


d° f(y) 2 
71 aE Le ee Ota oP Vd 
ah) yD iting ol 
My 44 
where ||? = ae Ani |?. Show that this estimate implies (1.60). Prove the statements 


made about f(y) = a Tr y*y + b Tr y? after (1.61). 

3. Suppose more generally that f(y) = a Tr y*y +6 Tr y* + c(Tr y)°. For what values of 
a, b, and cis f strongly convex? For what values of a, b, and c does one have the strong 
ellipticity condition (1.60)? 

4. The following exercise relates to the choice of the word “linearization” in describing the 
relation between the (1.12) and (1.20). For Q C R”, bounded with smooth boundary, 
definev 

PC O.c oS eGa.c) 
by 
Flu) = f"(ux)uce, 
the right side defined by (1.36). Assume f is C'°. Show that F' is differentiable, as a 
map between Banach spaces, and that 


DF(j)w = Lu, 


where Lw = Awzz, A = f’’(j), as defined by (1.40). 
5. If u = u(t, x) is a real-valued function on R x 2, show that the PDE for w giving the 
stationary condition for the function (1.67) can be written in the form 


(1.72) div fp(ue) = 0, 
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where, if f = f(p) = f(pi,..., pn), then fp(uz) is the vector field with components 
(Of /Op;) (ux). Compare (5.39). 


2. The divergence of a vector field 


Let M be an n-dimensional manifold, provided with a volume form w € A"M. 
Let X be a vector field on M/. Then the divergence of X, denoted div X, is a 
function on M that measures the rate of change of the volume form under the 
flow generated by X. Thus it is defined by 


(2.1) Lxw = (div X)w. 

Here, £x denotes the Lie derivative. In view of the general formula Lxa = 
da|X + d(a|X), derived in Chap. 1, since dw = 0 for any n-form w on M, we 
have 


(2.2) (div X)w = d(w|X). 


If 14 = R”, with the standard volume element 


(2.3) w=dzr,A--:-Adzn, 
and if 
(2.4) X= > XI (x) ne 
Lj 
then 
(2.5) “La 91 X4 (x) day A+++ A dai N-++ A dtp. 


Hence, in this case, (2.2) yields the formula used in (1.57) : 


(2.6) div X = S- a, XI, 


j=l 
where we use the notation 


of 


(2.7) Of = 5 
J 
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Suppose now that M is an oriented manifold endowed with a Riemannian 
metric gj~(x). Then M carries a natural volume element w, determined by the 
condition that, if one has a coordinate system in which g;x(po) = jx, then 
w(po) = dx, A --+ A dxy. This condition produces the following formula, in 
any oriented coordinate system: 


(2.8) w= /gdr,A---Aditn, 
where 
(2.9) g = det(g;x)- 


In order to derive (2.8) , note that if coordinates y are related to zx linearly, that is, 
Y= > Aji © ks then 


So dy; = ye AjrAje dx, dxy = So one dxz dxe, 


ike 
with 
gre = >> AejAje, 
J 
provided A = (Aj;x) is symmetric. Now construct A as the positive-definite 
square root of the positive-definite matrix G = (9j%(20))- In other words, if 


{v;} is an orthonormal basis of R" with Gu; = c;v,;, set Av; = c!?u,. The 


transformation law for A” A on A”R gives 


dy: \+++ A dyn = (det A) dz, A--- Adan, 
= Vg(xo) dai A--: A d&n, 


from which the formula (2.8) follows. 
We now compute div_X when the volume element on is given by (2.8) . We 
have 


(2.10) wiX =) \(-1)9 1X9 g day A---Adaj A+ Adtn 
g 


and hence 
(2.11) d(w|X) = 0;(./gX") day A+++ Adan. 
Here, as below, we use the summation convention. Hence the formula (2.2) gives 


(2.12) div X = g-/74,(g/2.X9), 
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We next derive a result known as the divergence theorem, as a consequence 
of Stokes’ formula, proved in Chap. |. Recall that Stokes’ formula for differential 
forms is 


(2.13) [ea Jo 
M 


OM 


for an (n — 1)-form on M, assumed to be a smooth, compact, oriented manifold 
with boundary. If a = w|X, the formula(2.2) gives 


(2.14) [osiv X)w= [olx. 


M OM 


This is one form of the divergence theorem. We will produce an alternative expres- 
sion for the integrand on the right before stating the result formally. 

Given that w is the volume form for 1/7 determined by a Riemannian metric, 
we can write the interior product w|X in terms of the volume element wg on 
OM, with its induced Riemannian metric, as follows. Pick normal coordinates 
on M, centered at p) € OM, such that OM is tangent to the hyperplane {z,, = 0} 
at pp = 0. Then it is clear that, at po, 


(2.15) j* (w|X) = (X,v)wa, 


where v is the unit vector normal to 0M, pointing out of M and j : OM @& 
M is the natural inclusion. The two sides of (2.15), which are both defined in a 
coordinate-independent fashion, are hence equal on OM, and the identity (2.14) 
becomes 


(2.16) [osiv X)w= [i v\we. 
M OM 


Finally, we adopt the following common notation: we denote the volume element 
on M by dV and that on OM by dS, obtaining the divergence theorem: 


Theorem 2.1. [f M is a compact manifold with boundary, X a smooth vector 
field on M, then 


(2.17) [ia X)dV = [i v) dS, 


M aM 
where v is the unit outward-pointing normal to OM. 


The only point left to mention here is that M/ need not be orientable. Indeed, we 
can treat dV and dS as measures and note that all objects in (2.17) are indepen- 
dent of a choice of orientation. To prove the general case, just use a partition of 
unity supported on orientable pieces. 
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The definition of the divergence of a vector field given by (2.1), in terms of 
how the flow generated by the vector field magnifies or diminishes volumes, is a 
good geometrical characterization, explaining the use of the term “divergence.” 
There are other characterizations of the divergence operation, of a more analytical 
flavor, which are also quite useful. Here is one. 


Proposition 2.2. The divergence operation is the negative of the adjoint of the 
gradient operation on vector fields; if X is a vector field and u a function on M, 
one compactly supported on the interior of M, then 


(2.18) (X, grad u) 12(M) = —(div X,U)12(M)- 
The asserted integral identity here is 
[oe grad u) dV (a) = — [ociv X)udV(x), 
M M 
provided either u or X has compact support in the interior of 1/7. Note that 


(X, grad u) = (X,du) = Xu. 


In fact, we will use the divergence theorem to obtain a more general result, in 
which neither u or X is required to vanish on OM. We apply (2.17) with X 
replaced by wuX. We have the following “derivation” identity: 


(2.19) div uX =udiv X + (du, X) =udiv X + Xu, 


which follows easily from the formula (2.12). The divergence theorem immedi- 
ately gives the following result. 


Proposition 2.3. If M is a smooth, compact manifold with boundary, ua smooth 
function, X a smooth vector field on M, then 


(2.20) [aw xuav+ [ xuav = [xsnuas. 
M M aM 
We can also express the adjoint of the differential operator X , defined by 


(2.21) / (X*u)o dV = / u(XB) aV, 


M M 


for v € C§°(M), using the divergence, as follows: 


Proposition 2.4. [f X is a smooth vector field on M, then 


(2.22) X*u = —Xu-— (div X)u. 
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This is equivalent to the statement that 


(2.23) [(Xu)u + u(Xv)] dV = — | (div X)uv dV, 
! ! 


for u,v € C§°(M). In fact, from (2.20) we can obtain the following more general 
result. 


Proposition 2.5. [fu and v are smooth functions and X a smooth vector field on 
a compact manifold M with boundary, then 


(2.24) [(Xu)u + u(Xv)] dV = — | (div X)uv dV + | (X,v)uv ds. 
! —— 


Proof. Replace u by uv in (2.20) and use the derivation identity X(uv) = 
(Xu)v + u(Xv). 


Exercises 


1. Given a Hamiltonian vector field 


n 


i aE in se ra 


j=l 


calculate div Hy directly from (2.6). 
2. If M is a smooth domain in R?, apply the divergence theorem (2.17) to the vector field 
X = g0/0x — fO/Oy to deduce Green’s formula: 


_ dg _ OF 

[ factady= [f (52 - a) de dy. 

OM M 

3. Show that the identity (2.19) for div (uX) follows from (2.2) and 
du \ (w|X) = (Xu)w. 


Prove this identity, for any n-form w on M”. What happens if w is replaced by a k-form, 
k<n? 
4. Relate Exercise 3 to the calculations 


(2.25) Luxa=ulLxatdud (txa) 
and 
(2.26) du \ (txa) = —tx(duA a) + (Xu)a, 


valid for any k-form a. The last identity follows from (13.37) of Chap. 1; compare with 
formula (10.27) of this chapter. 
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5. Show that 
div [X, Y] = X (div Y) — Y (div X). 


3. The covariant derivative and divergence of tensor fields 


The covariant derivative of a vector field on a Riemannian manifold was intro- 
duced in Chap. 1, §11, in connection with the study of geodesics. We will briefly 
recall this concept here and relate the divergence of a vector field to the covari- 
ant derivative, before generalizing these notions to apply to more general ten- 
sor fields. A still more general setting for covariant derivatives is discussed in 
Appendix C. 

If X and Y are vector fields on a Riemannian manifold M, then VxY is a 
vector field on MM, the covariant derivative of Y with respect to X. We have the 
properties 


(3.1) VopxyY = fVxY 
and 
(3.2) Vx(fY) = fVxY + (XP)Y, 


the latter being the derivation property. Also, V is related to the metric on M by 
G3) Z(X,Y) = (VzX,Y) + (X,VzY), 


where (X,Y) = gjnX JY* is the inner product on tangent vectors. The Levi- 
Civita connection on M is uniquely specified by (3.1) -(3.3) and the torsion free 


property: 
(3.4) VxY —VyX =[X,Y]. 
There is the explicit defining formula (derived already in (11.22) of Chap. 1) 


UVxY,Z) = X(Y,Z) + Y(X, Z) — Z(X,Y) 
(3.5) 
+ (IX, Y], 2) _ (IX, Z\,Y) _ ibe Z\,X), 


which follows from cyclically permuting X,Y, and Z in (3.3) and combining the 
results, exploiting (3.4) to cancel out all covariant derivatives but one. Another 
way of writing this is the following. If 


0 
(3.6) X=X'*D,, De=— (summation convention), 


OXk 
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then 

(3.7) Von =x*4 Die 
with 

(3.8) X* = O;X* +S 0 TEX, 


£ 


where the “connection coefficients” are given by the formula 


e 1 gf O9jn , OGkn — ADiK 
om Pa ie " Ox; al 


equivalent to (3.5). We also recall that Ogx,, / Ox, can be recovered from r jk 


a 
(3.10) Set = Gey ie + Ge6T 
vj 


The divergence of a vector field has an important expression in terms of the 
covariant derivative. 


Proposition 3.1. Given a vector field X with components X* as in (3.6), 
(3.11) dip Ie 5. 
Proof. This can be deduced from our previous formula for div X, 


div X = g-1/?A,(g'/2X4) 


(3.12) 
= 0;X4 + (A; log g!/?)X!. 


One way to see this is the following. We can think of V_X as defining a tensor 
field of type (1, 1): 


(3.13) (VX)(Y) =VyX. 

Then the right side of (3.11) is the trace of such a tensor field: 

(3.14) AI = Tr VX. 

This is clearly defined independently of any choice of coordinate system. If 
we choose an exponential coordinate system centered at a point p € M, then 
gjx(p) = djx and Og;x/Oxe = 0 at p, so (3.12) gives div X = 0; X% at p, in this 


coordinate system, while the right side of (3.11) is equal to 0;X! + I¥.;X e 
0;X? at p. This proves the identity (3.11). 
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The covariant derivative can be applied to forms, and other tensors, by requir- 
ing V to be a derivation. On scalar functions, set 


(3.15) Vxu= Xu. 
For a 1-form a, V xq is characterized by the identity 
(3.16) (Y,Vxa) = X(Y,a) — (VxY,a). 


Denote by X(M) the space of smooth vector fields on M, and by A1(M) the 
space of smooth 1-forms; each of these is a module over C'(1/). Generally, a 
tensor field of type (k, j) defines a map (with j factors of X(M) and k of A'(M)) 


(3.17) F:%(M)x---x X%(M) x A(M) x-+. x A'(M) — C™(M), 


which is linear in each factor, over the ring C'°(M). A vector field is of type 
(1,0) and a 1-form is of type (0, 1). The covariant derivative V x F is a tensor of 
the same type, defined by 


(3.18) 
CV SP Vig 1a Vp, Cj one) SAE eds FeO, oe) 


J 
— SO F(M,...,Vx¥e,-..,¥j,an,-.., On) 
f=1 


k 


= SF iia Yigg Vie), 
£=1 


where V x qv is uniquely defined by (3.16). We can naturally consider VF’ as a 
tensor field of type (k, 7 + 1): 


BID) (WEA Fis ag Po Cte Oe = (V Pav YG Otigt svg ip 
For example, if Z is a vector field, V Z is a vector field of type (1, 1), as already 
anticipated in (3.13). Hence it makes sense to consider the tensor field V(VZ), 


of type (1,2). For vector fields X and Y, we define the Hessian Vix.y)Z to be 
the vector field characterized by 


(3.20) (Vix yv)Z, a) = (VVZ)(X,Y, a). 
Since, by (3.19), if F = VZ, we have 
(3.21) F(Y,a) = (VyZ,a), 


and, by (3.18), 
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(3.22) (VxF)(Y,a) =X - (F(Y,a)) — F(VxY,a) — F(Y,Vxa), 
it follows by substituting (3.21) into (3.22) and using (3.16) that 

(3.23) Vix.y)Z = VxVvZ—Viyxy); 


this is a useful formula for the Hessian of a vector field. 
More generally, for any tensor field F, of type (j,/), the Hessian Vix) ; 


also of type (j,k), is defined in terms of the tensor field V?F = V(VF), of type 
(j,k + 2), by the same type of formula as (3.20), and we have 


(3.24) Vixyy)F = Vx(VyF) — Vivxy)F, 


by an argument similar to that for (3.23). 
The metric tensor g is of type (0, 2), and the identity (3.3) is equivalent to 


(3.25) Vxg=0 

for all vector fields X (i.e., to Vg = 0). In index notation, this means 
(3.26) 9jk:e = 0 or, equivalently, p* = 0. 

We also note that the zero torsion condition (3.4) implies 

(3.27) Uj: = Usk 


when wu is a smooth scalar function, with second covariant derivative VVu, a 
tensor field of type (0,2). It turns out that analogous second-order derivatives 
of a vector field differ by a term arising from the curvature tensor; this point is 
discussed in Appendix C, Connections and Curvature. 

We have seen an expression for the divergence of a vector field in terms of the 
covariant derivative. We can use this latter characterization to provide a general 
notion of divergence of a tensor field. If T is a tensor field of type (k, 7), with 
components 


(3.28) eA 


in a given coordinate system, then div T is a tensor field of type (k — 1,7), with 
components 


(3.29) Tcegh Pot, 
In view of the special role played by the last index, the divergence of a tensor 


field T is mainly interesting when 7’ has some symmetry property. In §7 we will 
introduce the stress-energy tensor, a symmetric second-order covariant tensor; 
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raising indices produces a symmetric second-order tensor field of type (2,0), 
whose divergence is an important object. 

In view of (3.11), we know that a vector field _X generates a volume-preserving 
flow if and only if X/,; = 0. Complementing this, we investigate the condition 
that the flow generated by X consists of isometries, that is, the flow leaves the 
metric g invariant, or equivalently 


(3.30) Lxyg = 0. 
For vector fields U and V, we have 


(Lxg)(U,V) = (L£xU,V) — U,L£xV) + XU,V) 
(3.31) = (VxU — LIU, V) + (U, VxV -—LxV) 
= (Vu X,V) + U,VvX), 


where the first identity follows from the derivation property of 2x, the second 
from the metric property (3.3) expressing X (U,V) in terms of covariant deriva- 
tives, and the third from the zero torsion condition (3.4). If U and V are coordinate 
vector fields D; = 0/0x;, we can write this identity as 


(3.32) (Lxg)(Dj, De) = greX "3 + 95eX x 


Thus X generates a group of isometries (one says X is a Killing field) if and 
only if 


(3.33) Ge X 5 + gjeX sp = 0. 
This takes a slightly shorter form for the covariant field 
(3.34) AH gk. 


We state formally the consequence, which follows immediately from (3.33) and 
the vanishing of the covariant derivatives of the metric tensor. 


Proposition 3.2. X is a Killing vector field if and only if 
(3.35) Xj + X54 = 0. 


Generally, half the quantity on the left side of (3.35) is called the deformation 
tensor of X. If we denote by € the 1-form € = > X; dz,;, the deformation tensor 
is the symmetric part of V€, a tensor field of type (0, 2). It is also useful to identify 
the antisymmetric part, which is naturally regarded as a 2-form. 


Proposition 3.3. We have 
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i 
(3.36) df= 5 S 0 (Xj:m — Xiyg) dace A dxy. 
jk 


Proof. By definition, 


(3.37) dé = : S(OnX5 — Oj Xx) dary A da;, 
jk 


and the identity with the right side of (3.36) follows from the symmetry r jk = 
I; fs 


There is a useful generalization of the concept of a Killing field, namely a con- 
formal Killing field, which is a vector field X whose flow consists of conformal 
diffeomorphisms of , that is, preserves the metric tensor up to a scalar factor: 


(3.38) Fg = a(t,x)g —> Lxg = X(2)g. 


Note that the trace of Lx g is 2 div X, by (3.32), so the last identity in (3.38) is 
equivalent to Lx g = (2/n)(div X)g or, with (1/2)Lxg = Def X, 


1 
(3.39) Def X — —(div X)g =0 


is the equation of a conformal Killing field. 

To end this section, and prepare for subsequent material, we note that concepts 
developed so far for Riemannian manifolds, that is, manifolds with positive- 
definite metric tensors, have extensions to indefinite metric tensors, including 
Lorentz metrics. 

A Riemannian metric tensor produces a symmetric isomorphism 


(3.40) G:T,M —T*M, 


which is positive. More generally, a symmetric isomorphism (3.40) corresponds 
to a nondegenerate metric tensor. Such a tensor has a well defined signature 
(i,k), j +k = n = dim M; at each x € M, T,M has a basis {e1,...,en} 
of mutually orthogonal vectors such that (e;,e1) = --: = (e;,e;) = 1, while 
(€j41,€j41) = +++ = (€n,€n) = —l. If 7 = 1 (ork = 1), we say M has a 
Lorentz metric. 

The concepts discussed in this section in the Riemannian case, such as the 
covariant derivative, all extend with little change to the general nondegenerate 
case. We will see this in use, in the Lorentz case, in §7. 
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Exercises 


1. Let y be a tensor field of type (0, &) on a Riemannian manifold, endowed with its 
Levi—Civita connection. Show that 


(Lxp—Vxy)(Ui,...,Uk) = >> ei, ..., Vu, X,..., Uk). 
4 


How does this generalize (3.31)? 
2. Recall the formula (13.56) of Chap. 1, when w is a k-form: 


k 
(dw)(Xo,..-,Xn) = $0 (-1)7 Xj + w(Xo,...,Xj,.-.,Xe)+ SS (-1)""* 
j=0 O<e<j<k 
eal es Mia woes Samy asin ag DR 
Show that the last double sum can be replaced by 


—S7(-1)?w(Xo, «0. Vx; Xe, 0 Ky. Xe) 
b<j 


— So (-1)'w(Xo,...,Xj,.-., Vx; Xe,..-, Xr): 


L>j 


3. Using Exercise 2 and the expansion of (Vx,w)(Xo,..., , ..., Xk) via the deriva- 
tion property, show that 


(3.41) (dw)(Xo,...,X%) = (1) (Vx,0) (Xo, veg Mi ccse RE 


j=0 


Note that this generalizes Proposition 3.3. 
4. Prove the identity 


Jlog /G _ 2 
a = bj. 


Use either the identity (3.11), involving the divergence, or the formula (3.9) for e jk. 
Which is easier? 

5. Show that the characterization (3.17) of a tensor field of type (k, 7) is equivalent to the 
condition that F’ be a section of the vector bundle (@? a) ® ( x a) or, equivalently, 
of the bundle Hom (@/T, @*T). Think of other variants. 

6. The operation X; = gj~rX ® is called lowering indices. It produces a 1-form (section 
of T* M) from a vector field (section of TM), implementing the isomorphism (3.38). 
Similarly, one can raise indices: 


yi= g'"Yk, 


producing a vector field from a 1-form, that is, implementing the inverse isomorphism. 
Define more general operations raising and lowering indices, passing from tensor fields 
of type (j, &) to other tensor fields, of type (€,m), with 0+ m = j +k. One says that 
these tensor fields are associated to each other via the metric tensor. 

7. Using (3.16), show that if ~@ = a,(x) dx, (summation convention), then Vp,;e= 
ak:j Ax, With 


7 : £ 
ak;j = Oj Ak = T kj Qe. 
£ 
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Compare with (3.8). Use this to verify that (3.36)and (3.37) are equal. 
8. Expanding on the previous exercise, work out a corresponding formula for V p,T 
when T is a tensor field of type (j, &), as in (3.28). Show that 


Tg = OT ETO gg PTT ys, 
More generally, 


Tong nny EPR sp = Bx Tang erg YT Tong cet ERT Pg 
s 

2S ta re. 
t 


9. Using the formula (3.23) for the Hessian, show that, for vector fields X, Y, Z on M, 
(Vix,x) — Viva) Z = ([Vx, Vv] — Vix) Z. 


Denoting this by R(X, Y, Z), show that it is linear in each of its three arguments 
over the ring C°° (M), for example, R(X, Y, fZ) = f R(X, Y, Z) for fe C°(M). 
Discussion of R(X, Y, Z) as the curvature tensor is given in Appendix C, Connections 
and Curvature. 
10. Verify (3.24). For a function u, to show that Vix,yyu = Viv.x)Us use the special 
case 
Vixyyu = XYu—(VxY)-u 


of (3.24). Note that this is an invariant formulation of (3.27). Show that 


(Lvg)\(X,Y), V= gradu. 


Nl eR 


Vix,yyu = 


11. Let w be the volume form of an oriented Riemannian manifold 1/7. Show that V xw = 
0 for all vector fields _X. 

12. Let X be a vector field on a Riemannian manifold /. Show that the formal adjoint of 
V x, acting on vector fields, is 


(3.42) VxY =—-VxY — (div X)Y. 
13. Show that the formal adjoint of 2x, acting on vector fields, is 
(3.43) LXY =—-LxY — (div X)Y — 2 Def(X)Y, 


where Def(X) is a tensor field of type (1, 1), given by 


(3.44) (Lxg)(Z,Y) = 9(Z, Def(X)Y), 


Nle 


g being the metric tensor. 
14. With div defined by (3.29) for tensor fields, show that 


(3.45) div (X @ Y) = (div Y)X + VyX. 
15. If X,Y, and Z have compact support, show that 


(Z, div (X @Y))z2 = —(VyZ, X)z2. 


4. The Laplace operator on a Riemannian manifold 163 


16. If y(s) is a unit-speed geodesic on a Riemannian manifold M, y'(s) = T(s), and X 
is a vector field on M, show that 


d 


(3.46) a 


(T(s),X(7(s))) = 5 (£x9) (2,7). 


Deduce that if X is a Killing field, then (7, X) is constant on -. Relate this to the 
conservation law for geodesic flow on a surface of revolution, discussed in Chap. 1, 
§16. (Hint: Show that the left side of (3.46) is equal to (T, Vr-X).) 
17. If we define Def: C°°(M,T) — C°(M,S?T*) by Def(X) = (1/2)Lxqg, show 
that 
Def*u = — div u, 


where (div uw)? = uJ” x, as in (3.29). 


4. The Laplace operator on a Riemannian manifold 


We define the Laplace operator on a Riemannian manifold MM, with metric gj, in 
a way that naturally generalizes the characterizations of the Laplace operator on 
Euclidean space, given by (1.49), (1.62), and (1.63). Taking (1.62) as fundamen- 
tal, we define the Laplace operator A on M to be the second-order differential 
operator satisfying 


(4.1) —(Au,v) = (du, dv) = (grad u, grad v), 
for u,v € Cg°(M). Here the left side is 
(4.2) 7 / (Au)s dV, 

M 


where dV is the natural volume element, given in local coordinates by \/gdz, -- - 
dx,,. The right side of (4.1), for u and v supported in a coordinate patch, is 


/ ee ae / PLC HING OW CES 


(4.3) 
__ | Pou(a'/20" Aju) g/g? de, 


integrating by parts, so we see that A is given in local coordinates by 
(4.4) Au = g-/? 8;(g7*g!/? du). 


Soon we will see how to modify (4.1) when u and v do not vanish on 0M, in case 
M is acompact Riemannian manifold with boundary. 
We now show that (1.63)generalizes, that is, we have 
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(4.5) Au = div grad u. 
In fact, in view of the formula 
div X = g-/? ;(g'/?X2) 
derived in (2.12), together with 
XI = g/* Au, for X = grad u, 


we see that (4.5) follows directly from the local coordinate formula (4.4). Note 
that the identity 


(4.6) (X, grad v)p2 = —(div X,v)p2, 


proved in (2.18), when applied to_X = grad u, also gives (4.5) directly. 
Applying the refinement (2.20) of (4.6) gives us important identities due to 
Green. Let us use the notation 


Ou 
(4.7) 7 a (grad u,v) 


for the normal component of grad u; Ou/Ov is called the normal derivative of u. 
If we exploit (2.20) with X = gradd, we get the identity (4.8) below; if we inter- 
change u and v and subtract the resulting expression from (4.8), we obtain (4.9). 
This provides a proof of Green’s identities: 


Proposition 4.1. If M is a compact Riemannian manifold with boundary, then 
foru,v € C®(M), we have 


(4.8) —(u, Av) 2 = (du, dv) — / u(3") dS 
aM 

and 

(4.9) (Au, v) — (u, Av) = i: [(S*)e — u(2?)] dS. 


OM 


Next we express the Laplace operator in terms of covariant derivatives. As we 
have seen, 
div X = X7,;. 


If we set X = grad u, we obtain 


(4.10) Au = g*u.j:k; 
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using the fact that g/*.. = 0. Here, )> u.j.~dv, @ dx; is a tensor field of type 
(0, 2), which is the same as Vu. Recall that V?F is a tensor field of type (7, k+2) 
whenever F is a tensor field of type (j, k). The formula (4.10) can be rewritten as 


(4.11) Au= TrgV7u, 


where Tr, denotes the trace of V7u(z), as a quadratic form on T;, /, in terms of 
the quadratic form given by the metric tensor g. In other words, we can define a 
tensor field H(w), of type (1, 1), by 


(4.12) (H(u)X,Y) = (V7u)(X,Y), 


and TrgV7u = Tr H(u). 

Since the Laplace operator is defined in a coordinate-independent manner on 
a Riemannian manifold, it is clear that if fF : 14 — M is a diffeomorphism and 
F* : C*°(M) > C®(M) is defined by F* u(x) = u(F(x)), then F* commutes 
with the Laplace operator provided F’ is an isometry. Thus, if X is a vector field 
on M, X commutes with A provided the flow F{ generated by X consists of 
isometries. This result has a converse. 


Proposition 4.2. A vector field X commutes with A if and only if X generates a 
group of isometries. 


The proof rests on a computation of independent interest. In fact, a manipula- 
tion of (4.10), which we leave to the reader, yields the general identity 


[A, X]u = ae + A ig + a + KY) stig 
(4.13) 
= go 0; (gi/2(xaie + X*) Ou). 


Thus [A, X] = 0 if and only if X9** + X* = 0, which is equivalent to the 
condition (3.35) for a Killing field. 


Exercises 


1. fue C°(M), X = grad u, the condition that X generates a volume-preserving flow 
is that Aw = 0. What PDE on uw is equivalent to the statement that X is a Killing field? 
2. Verify formula (4.13) for [A,X]. Show that it has the invariant formulation 


(4.14) 5[A.X]u = (Def(X), V7u) + (div Def(X),du) = div(Def(X) - du), 


in terms of the deformation tensor Def(X), with components (1/2)(XJ** + _X*), that 
is, the type (2, 0) analogue of the tensor field of type (1, 1) given by (3.42), or the tensor 
field of type (0, 2) equal to half of (3.35). 

3. Show that the Laplace operator A = 07/07 +---+0?/0x?, on R” has the following 
expressions in various coordinate systems: 
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(a) Polar coordinates on R?: 21 =r cos 0, v2 =r sind. 


oO ie 0 4 1 & 
Or? r Or r? 062" 


(4.15) A= 


(b) Spherical polar coordinates on R*: 2; = p sing sin@, 22 = p sing cos@, 
£3 = p cosy. 
PF 20 1 a oe 


: 0 
(4.16) A Dp + 0a BP SG on ae +cosyq-). 


(c) Spherical polar coordinates on R": 2 = rw, w € S”~*. 


a? n-10 1 
(4.17) A = a Bp + As: 


where Ag is the Laplace operator on the unit sphere S”~!. (Compare (4.19) below.) 
(Hint: Express the Euclidean metric tensor ds? = dx? +- - -+-dx in these coordinates.) 

4. Let N be a Riemannian manifold, of dimension n — 1. Denote by C'(NV) the cone with 
base NV, that is, the space R* x N, with Riemannian metric 


(4.18) g= dr? +r? gn. 


Show that the Laplace operator on C'(V) is of the form 


(4.19) A 


where Ay is the Laplace operator on the base N. Apply this to the expression of the 
Laplace operator A on R”, in polar coordinates, with N = S"~*. 
5. Show that, in local coordinates, 


Au = g?* 0;Onu— g'*T jx Oou. 


5. The wave equation on a product manifold 
and energy conservation 


The analysis of vibrating membranes in Euclidean space has important extensions 
to studies of vibrating manifolds. We will start with a fairly general situation, 
specializing quickly to models that give rise to “the wave equation” 


(5.1) —_ — Au=0, 


for u = u(t, x), a scalar function on R x M, where A is the Laplace operator on 
M defined in §4. 

We consider vibrations of one manifold M within another, N. Suppose these 
manifolds are endowed with Riemannian metric tensors g and h, respectively. The 
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vibration is described by a map 
(5.2) u:Rx MN. 


In §1 we dealt with the special case where M/ is a bounded region in R” and 
N = R*. Now we allow M to be a compact manifold with boundary. We again 
use a Stationary action principle to produce equations governing the vibration. The 
appropriate expression for “kinetic energy” is 


(5.3) T() = 5 | m(a)lun(t,2)/ av. 
M 


where dV is the natural volume element on M and m(a) > 0 is a given “mass 
density.” The velocity u,(t, x) takes values in T,.N, with y = u(t, 2), and the 
square-norm in the integrand in (5.3) is given by the metric tensor h; 


(5.4) Juz|? = A(u, we, Ue) 
if h(y, v, w) denotes the inner product of v and w in T,.N. 


The form that we will consider for the potential energy is the following gener- 
alization of (1.3): 


(5.5) V(t) = [ feul,2),ult.2)) dV, 
M 

where 

(5.6) ti; (6,0) € LOM, Teel), 


and f is a smooth, real-valued function defined on the bundle £ over M x N with 
fiber over (x, y) given by £(T, M,T,N): 


(5.7) f=flx,y,A), AE LTM,T,N). 


In particular, one has examples analogous to (1.42), that is, 


where A* € L(T,N,T,M) is the adjoint of A, defined using the inner prod- 
ucts on TM and T,.N defined by their Riemannian metrics. The gg(A*A) are 
defined as described below (1.42). Many interesting cases of this sort arise natu- 
rally, including 


(5.9) f(a,y, A) = Tr A*A. 
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Applying the stationary action principle will yield for u a second-order sys- 
tem of PDE of a form that generalizes (1.12). We look here at the details for a 
special case. 

Namely, take N = R, and suppose f(x,y, A) is independent of y € R. In 
other words, we consider a potential energy of the form 


(5.10) V(t) = | f(a, us(t,2)) aV, 
i 


where u,(t, 7) € TM and f = f(x, €) isa smooth, real-valued function defined 


on T* M, or perhaps on some open subset. In that case, the stationary condition for 
ti 


(Jo — Ji)(u) = ss [T'(t) — V(t)] dt is derived from the following calculations. 
First, as in (1.8), 


d 
(5.11) — Jo(ut sv)| — -{ muzv dV dt, 
ds a=) 


provided v € C§°(I x M), I = (to, t1). Here M denotes the interior of 1. 
Furthermore, for such v, 


d 
(5.12) re Ji(u+ #0) «6 a | fe(@, Ux) > Ue dV dt, 


where, in local coordinates, 
Of Ov 
(5.13) fe(@,Ue)-ve= >> a =. 
j 


If v is supported in a coordinate patch, in which dV = ,/gdx, we can integrate 
by parts and write 


d 
(.14) = Ay(u+ sv)|,_9 = — [f[x0rr On, (g"? fe, (x, ux))v/g der dt. 
J 


Thus we get the following PDE for u, in a local coordinate system: 
(5.15) muy — 9? Oe;(g'/? fe; (Ux) = 0, 


using the summation convention. Written out more fully, this is 


1 
(5.16) miuia~ [fese (2s Ur Merja + fey2, (0, Us) +59" "(De 9) fey (2, x) = 0. 


An invariant formulation of this PDE is given in the exercises. 
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The choice of f(x, €) that produces a wave equation of the form (5.1) is that 
of a constant times the Riemannian metric on covariant vectors: 


(5.17) f(a, €) =o g(@,€,) =o 9" &€e, 
with o a positive constant. In that case, (5.15) becomes 
(5.18) mur — 20oAu = 0 

in view of the local coordinate formula 

(5.19) Au =g /? 8; (g/g) O,u) 


derived in 84. If m is a constant, this is of the form (5.1) provided 20 = m, which 
could be arranged by a rescaling of the t-variable. 

Other choices of f(a, €) arise naturally in the study of vibrating membranes, 
choices that lead to nonlinear PDE. We will return to this in Chap. 16, but for now 
we concentrate on the linear case (5.18), until the very end of this section where 
we make a few brief comments on nonlinear problems. 

Let us redo the calculation of the variation of J, (uw) in an invariant fashion, 
when f(x, &) is given by (5.17), so 


(5.20) Ji(u) = off |d,u|? dV dt. 


IxM 


fe) 


We have, for v € C§°(I x M), 


d 
(5.21) 7 Ji(ut sv)|,_ = 20 Jf (dare dee) dV dt, 


and Green’s formula (4.8) shows that this is equal to 


(5.22) —20 [fae dV dt, 


since the boundary integral vanishes in this case. Again the stationary condition 
for (Jo — Ji)(u) is seen to be the wave equation (5.18). 
As in(1.26), it is typical to specify initial conditions, of the form 


(5.23) u(0,2) = f(x), uz(0,2) = g(x). 


If OM # @, we also need to specify a boundary condition for u. One typical 
condition is 


(5.24) u(t,7) =0, forz € OM. 
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This is known as the Dirichlet boundary condition for u. It models a vibrating 
drum head that is firmly attached to its boundary. Tying down the boundary pro- 
vides a justification for considering only variations v that vanish on J x OM in the 
specification of the stationary condition above. Another natural physical problem 
is to describe vibrations of (Z when the boundary is allowed to move freely. Then 
we should allow any v € C®(I x M) that vanishes at t = to and t = ti, asa 
variation. The formula (5.11) for the variation of Jo(w) continues to hold, and so 
does (5.21), but an application of Green’s formula to (5.21) now yields 


d Ou 
(6.25) = (ut sv)|,_9 =—20 [fae dV dt +20 i v = dS dt. 


IxM IxOM 


If we do apply this to the subclass of v € Cf°(I x M), we see that the wave 
equation (5.18) must still be satisfied for u to be a stationary point. Now, granted 
that u satisfies (5.18), we hence have 


d Ou 
(5.26) Gz (Jo - Ayu + sv)|,_) = —20 i v5, aS dt, 
IxOM 


for all v € C(I x M) that vanish at t = to and at t = t,. This yields the 
following boundary condition for freely vibrating M: 


(5.27) au =0, forr € OM. 
Ov 


This is known as the Neumann boundary condition for u. Another situation it 
models is the propagation of small-amplitude sound waves in a region bounded 
by a hard wall. 

Since we have introduced the kinetic energy and the potential energy, we 
should look at the total energy. In the case when (5.17) gives the potential energy, 
if we take m = 1 and o = 1/2, the total energy is 


(5.28) E(t) = : / [lete(t, 2) |? + (ders, dau)] dV (2). 
M 


We aim to establish the energy conservation law 
(5.29) E(t) = const. 


whenever wu is a sufficiently smooth solution to the wave equation (5.1), assuming 
that wu satisfies either the Dirichlet condition (5.24) or the Neumann condition 
(5.27) on OM. In fact, we have 


E 
(5.30) a = [weer + (dou, dx) | dv. 
M 
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We want to factor u; out of the integrand, so we integrate by parts the last term in 
(5.30), using Green’s identity to get 


dE 


(5.31) ar 


_ Ut (Ute = Au) dV + / Ut Ou dS. 
Ov 
M OM 


The right side of (5.31) vanishes provided wu satisfies the wave equation and either 
the Dirichlet or Neumann boundary condition. This proves the energy conserva- 
tion law (5.29), equivalent to 


(5.32) y nears Gadai lar / [la(2)|? + (def, de f)] aV, 


M M 


given the initial conditions (5.23). 

We continue briefly the discussion of stationary problems from the end of §1. 
These problems do not involve t-dependence, that is, they arise via describing 
critical points for a function 


(5.33) i(u) = f f(0,u(z),ue(@)) dV, 
M 

with 

(5.34) f=f(a,y,A), A€L(T,M,T,N). 


If N = Rand f(a, y,€) = f(a, &) is given by (5.17), then the PDE obtained as 
the stationary condition for J(w) is 


(5.35) Au =0, 


involving the Laplace operator (5.19). A typical boundary condition is the nonho- 
mogeneous Dirichlet condition 


(5.36) u=wW ondM. 
Another is the nonhomogeneous Neumann condition 


(5.37) ou =y ondM. 
OV 
These will be studied in Chap. 5. 
There are also very important nonlinear problems arising from the problem of 
finding stationary points, particularly extrema, of (5.33). We mention in particu- 
lar the choice (5.9) for f(x,y, A), namely, Tr A*A. Maps u : M — N critical 
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for such J(u) are called harmonic maps. In case N = R*, these are just func- 
tions whose components are harmonic in the sense of (5.35), but for a nonflat 
Riemannian manifold N, one gets a nonlinear problem. For example, as seen in 
Chap. 1, for 4 = I C R, one gets the geodesic equation. Harmonic maps will 
be studied in Chap. 14, by variational methods, and in Chap. 15, via techniques 
involving nonlinear parabolic PDE. 


Exercises 


1. For Ji(u) = fy, f(©, ue) dV as in (5.10), f : T*M — R, demonstrate the invariant 
formula 


d 
as Ai(ut+t sv)| 6 = [Ase e), 0) dV, 
M 
where Ay : T*M — TM is given by 


(5.38) As(x,€) = Da(a,€)Hy, 


Hy being the Hamiltonian vector field of f, and 7 : T*M — M the natural projection. 
For fixed t, uz = dzu is a 1-form on M. Consequently, Af(a, Uz) is a vector field on 
M. 

2. In the context of Exercise 1, show that the resulting PDE (5.15)has the invariant descrip- 
tion 


(5.39) mute — div Ar(x, uz) = 0. 
Compare (1.72). 
3. Show that (under an appropriate nondegeneracy hypothesis) maps of the form A f invert 
Legendre transformations A : TM — T™* M, discussed in §12 of Chap. 1. 
(Hint: Using (12.9)—(12.18), consider the Legendre transform associated to the function 
F(ax,v) on TM defined implicitly by 
F(a, fe(x,€)) = f(a, €) —€- fe(a, €) 


or, in the notation used above, 


F(Aj(2,€)) = f(a, €) (Ar (a, €), €).) 


6. Uniqueness and finite propagation speed 

We study some properties of solutions to the wave equation on R x M: 
(6.1) ut — Au =0, 

with initial conditions 


(6.2) u(0, x) = f(z), uz(0, 2) = g(x), 
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and boundary condition either the Dirichlet condition or the Neumann condition, 
if OM + 0). We leave aside for the present the issue of the existence of solutions, 
for arbitrarily given f and g. We examine the uniqueness; wu is assumed sufficiently 
smooth. If u; and uz solve (6.1) with initial data f;, gj, then wi — uz solves (6.1) 
with initial data f = f; — fo, g = gi — go. To establish uniqueness, it suffices 
to show that if f = g = 0, then the solution u = 0 for all ¢. But by energy 
conservation, we have, for all t, 


(6.3) [te + (dru, dzu)] dV = fis? + (dz f,dzf)| dV =0. 


M M 


Thus wu is constant. Since u(0, 2) = 0, we conclude that u = 0 everywhere. This 
establishes uniqueness. 

A closer look at how Green’s formula enters into this argument will produce 
both a generalization of the notion of energy conservation and a localization of this 
uniqueness theorem to a result implying finite propagation speed for solutions to 
the wave equation. Note that the identity (5.31) can be written as 


U 


ta te 
om he "om 


V 


In particular, for u satisfying either the Dirichlet or Neumann condition on OM, 
with Q = [¢1,t2] x M, we have 


ice = Au) dV dt = 
Q 
1 
5 fh lel? + [deul?] av fh flu? + [dou] av. 
{t=te} {t=ti} 


(6.5) 


Next we want to look at the left side of (6.5) when 2 is a more general sort of 
region in R x M than a product region [t,, ta] x M. 

First, we assume for simplicity that 2 does not intersect R x OM. We sup- 
pose O02) consists of two smooth surfaces, ©}; and ‘No, as indicated in Fig. 6.1. We 
denote by 9; the intersection of 2 with {t} x M Cc R x M. Now, making use of 
formula (2.19), we have 


1 
Q 


(6.6) ° . 


_ [ sivel grad,,u) dV dt. 
Q 
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E, UZ, =aQ 2, 


x, 
x 


FIGURE 6.1 Spacelike Bounded Region 


Note that 


| & 


(6.7) (dt, dy) = (d,U, dx). 


Nl]e 
jes) 


t 


Applying the fundamental theorem of calculus to the first two integrals on the 
right side of (6.6), and the divergence theorem to the last integral, we get 


1 0 
(6.8) J vua—Au) dV dt = 5 [ltHideu dew] f / Us dS; dt. 
Vy, 
Q rere} O24 
Both integrals on the right side of (6.8) are integrals over OQ. Here w is the volume 
form on M, thought of as an n-form on R x M, pulled back to OQ, and dS; is 
the natural surface measure on 01), thought of as a surface in M. We want to 
express both w and dS; dt in terms of the natural surface measure on OF), induced 


from the inclusion 0Q C R x M, endowed with the natural product Riemannian 
metric. Indeed, we easily obtain 


(6.9) w=N,dS, dS, dt =|N,| dS, 


where N = (N;, N,,) is the outward unit normal to 0Q Cc R x M. Hence (6.8) 
becomes 


1 Ou 
Q 0a 


Thus, if u satisfies the wave equation in 2), we see that 


a 
ik [u? + |daul?] LNe| — 2ue=“|No|} dS 


D>) 


(6.11) 
- 1 [uz + |dxu|?] |Ne| + 2ur [Nol } dS. 


M1 
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% 


X,(s) 
zy, 


FIGURE 6.2 Spacelike Sweep 


This is a useful “energy identity” provided the integrands are positive-definite 
quadratic forms in du = (uz, d,u). Note that Cauchy’s inequality implies 


a 
(6.12) 2 | <u2 + |dpul?. 
OVy 


Thus the integrands have the desired property, provided 


(6.13) |Ne| <[Nel. 


Definition. A surface % C R x M is called spacelike provided its normal N = 
(N:, N..) satisfies (6.13). A vector satisfying (6.13) is called timelike. 


Clearly any surface t = const. is spacelike, as is a small perturbation of such a 
surface. Suppose 2 C R x M is bounded by spacelike surfaces 4; and Nz and 
furthermore is swept out by spacelike surfaces 2(s), as in Fig. 6.2. We call Qa 
domain of influence for its lower boundary 4. 


Theorem 6.1. Suppose Q C Rx M is a domain of influence for its lower bound- 
ary %4. If u solves the wave equation un — Au = 0 on R x M, and if u and 
du = (uz, du) vanish on }4, then u vanishes throughout Q. 


Proof. The energy identity implies that du vanishes on each 42(s); hence du 
vanishes on 2), so u is constant on 2. Since u = 0 on %4, this constant is 0. 


One interpretation of this theorem is that it shows that signals propagate at 
speed at most 1. In other words, in the special case 4) = {t = 0}, if u(0,x2) = 
f(a) and u¢(0, 2) = g(x) vanish on some open set O C M, then the solution to 
the wave equation vanishes on {(t, 7) : « € O, dist(x,0O) > |t|}. 

A slight variation of the argument above treats the case when OQ consists of 
three parts, 4; and Sg, both spacelike as above, and a part in R x 0M, provided 
the solution u to uz — Au = 0 satisfies the Dirichlet or Neumann boundary 
condition. 
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Exercises 


1. Use(1.24)-(1.28) to write out the explicit solution to the initial value problem (6.1)— 
(6.2) in case A = 6? / Ox? on R, and explicitly observe finite propagation speed in this 
case. 

2. Extend the finite propagation speed argument of Theorem 6.1 to the case where MV has 
a boundary, on which either the Dirichlet or Neumann boundary condition is imposed. 

3. Consider the equations of linear elasticity, derived in (1.59), Lu = 0, where 


Lu = mutt — wAu — (A +4 p) grad div wu. 


Suppose ps > 0, A + 24 > 0,m > 0. For each (t,x) € R x M, u(t, x) € Tr M. Take 
M = R”. Let 2 be a region in R x M of the form depicted in Fig. 6.1. Perform an 
integration by parts of 


| ut: Lu dV dt, 

Q 
along the lines of (6.6)—(6.10), to derive an identity similar to (6.11). What geometrical 
conditions should be placed on ©; and Xz, replacing the “spacelike” condition (6.13), 
in order to ensure that the resulting integrands are positive-definite quadratic forms in 
Vu = (ut, Vru)? Derive a finite propagation speed result. 


7. Lorentz manifolds and stress-energy tensors 


The analysis of the wave equation in the last section made strong use of the fact 
that we were working with 0? /Ot? — A ona product R x M. We will take a deeper 
look at the notion of energy, which will produce concepts that are important in the 
study of the wave equation on more general Lorentz manifolds. 

For starters, we will stick with the product case R x M, M a Riemannian 
manifold. This has a natural structure of a Lorentz manifold, with metric 


(7.1) h = —dé? + g. 


Contrast this with the Riemannian metric dt? + g on R x M we considered in the 
last section. In coordinates, h;;, has the form 


(7.2) (ae) = fe 7 
pV 


The stress-energy tensor T associated with u is supposed to be a symmet- 
ric, second order tensor such that, if Z is a unit timelike vector (representing the 
“world line” of an observer), then T'(Z, Z) gives the observed energy density. The 
energy density (1/2)u? + (1/2)(d.u, d,u) encountered before specifies 


1 1 1 
(7.3) Too = xu + 5 (dot, d,u) = up + 5 (du, du), 
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where 
(7.4) (du, du) = h3® O;u O,u 


is the Lorentz square-length of du. If we expect that T is constructed in a “natural” 
manner from du and the metric tensor h, we are led to require 


1 
T(Z, Z) = (Z, du)? + 5 (du, du) whenever (Z, Z) = —1. 


If (Z,Z) = —2z?, this leads to T(Z, Z) = (Z, du)? — (1/2)(du, du)(Z, Z), and 
polarizing this identity gives 


(7.5) T(Z,W) = (Z, du)(W, du) — = (du, du)(Z,W). 


1 
2 
This should hold for all vectors Z, W. Equivalently, we write 


1 
(7.6) T = du® du— 3 (du, du)h. 


We call(7.6)the stress-energy tensor associated to a wave u = u(t, x). See the 
exercises for more on the construction of T’. 

More generally, let 9. be any Lorentz manifold, with metric tensor, of signature 
(n, 1), denoted h. The “Laplacian” in this metric is defined by 


(7.7) u = |hl-1/? 8, (hF*|Alt/? Ou) = A? u.;.n, 

in analogy with the formula for the Laplace operator on a Riemannian manifold. 
Here, |h| = |det (h;;,)|. The wave equation on a general Lorentz manifold is 
(7.8) u=0. 


In this more general context, it is still meaningful to assign to u the tensor T, 
defined by (7.5) and (7.6). We continue to call T' the stress-energy tensor. We 
have the following important result. 


Proposition 7.1. For a solution to (7.8) on a general Lorentz manifold Q, the 
stress-energy tensor has vanishing divergence, that is, 


(7.9) T*., =0. 


More generally, for any u, 


(7.10) TH, =u Ou. 


Proof. This is a straightforward calculation. We have 
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711 TIK — yak Lik pny 

(7.11) =u"u 5 Usp Usv 

where uJ = h/*u.;, denotes the gradient. Hence, using h?*., = 0, we obtain 


F Ao ee lirer 1. 
TI" 5 = ud pui® + usurp _ ge Uy Uy - gee Uy tharck 


= u4Ou+ ud pui® _ DIEM Uk Use 


= wOut ud pui® = up, Fue, 


Since, as we have seen, w.j., = U.k.j, we obtain (7.10), and the proposition 
follows. 


We have seen that the divergence theorem applies to reduce the integral 
i q(div X) dV to a boundary integral, when X is a vector field; in particular, when 
X is a divergence-free vector field, it yields that a certain boundary integral is zero 
or, equivalently, that integrals over two parts of OE are equal in magnitude. How- 
ever, J’ is not a divergence-free vector field; it is a second-order tensor field. In 
general vanishing of div T will not lead to integral conservation laws. It will, 
however, in the following case. 

Suppose a Lorentz manifold 2 has a timelike Killing field Z, that is, a timelike 
vector field whose flow preserves the metric tensor h. As derived in the Rieman- 
nian case, the condition for the metric to be preserved is 


(7.12) Zin + Zag =0, 25 = hyn Z*. 


Here, “timelike” means that h(Z, Z) < 0. This means Z lies inside the light cone 
determined by the Lorentz metric. 


Lemma 7.2. If T!" is divergence free and Z" is a Killing field, then 
(7.13) XI = T!* Z, is divergence free. 


Proof. We have 
XI, = TH Ze + TI Zig 5. 


Now the symmetry of 79" implies TJ",; = 0 and 
ik | ak 
DM Leg = GT (Zag + Zak) = 0, 


assuming (7.12) holds. This proves the lemma. 


We denote the vector(7.13) by 


(7.14) X=TZ. 
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z 


FIGURE 7.1 Timelike Curves 


Suppose QO is a region in the Lorentz manifold 9, bounded by two surfaces 44 
and No, as in Fig. 7.1. 
By (2.14)), we have 


0= | (div Fz) av = / w|(TZ) 


(7.15) _ ane 7 
= / (FZ,1) dS — / (FZ, v2) dS, 
1 »3D) 


where v7; is the unit vector, normal to &;, with respect to the Lorentz metric h, 
pointing in the same “forward” direction as Z. The last identity in (7.15) holds 
in analogy with (2.15). We make the hypothesis as before, that 4; and Nz are 
spacelike (i.e., v; are timelike), so it makes sense to specify that they lie inside 
the forward light cone. Equation (7.15) is equivalent to 


(7.16) [re V2) dS = [re 4) dS. 
He par 
The volume element dS on ; is determined here by the Riemannian metric on 
u;, induced by restricting the Lorentz metric h to tangent vectors to 4;. 
Again we seek to guarantee that the integrand in (7.16), which is a quadratic 


form in du for T given by (7.5), is positive-definite. In order to check this at a 
point po € OO, choose a coordinate system such that 


7.1) (yalea)) = (“Gi Ps Meo) =(,0,...,0)" (= 14 oF), 


which is always possible. Suppose Z(po) = (Z°, Z',..., Z”). The condition that 
Z (po) belong to the forward light cone is 


(7.18) Bs0, (Ys (HF YP+--+(2"/. 


Now, if we set M = Ty, then, at po, 
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1 . 
(7.19) M°= <5 [(Oou)? + (Oyu)? +--+ (Onu)?], MM? = (dou)(d;u), 


if T is given by (7.5). Consequently, at po, 


T(Z,v) = (Z,M) = —Z°M® + S~ Z) Mi 


j=l 
(7.20) 
_lyo 2 2 j 
= 5% [(dou)? ++ +++ (Onu)?] + $5 27 (Agu)(d;u). 
j=l 
The positive definiteness of this quadratic form in (Oou,...,0,u) follows imme- 


diately from Cauchy’s inequality, granted (7.18). This definiteness calculation 
does not use the hypothesis that Z is a Killing field, of course. For positive defi- 
niteness of T(Z, v) in du, it suffices that Z and v both be nonzero timelike vectors 
inside the forward light cone. 

In order to emphasize that the dependence of T(Z, v) on du has fundamental 
significance, we adopt the following notation. Set 


Ez,,(du) = T(Z,v) 
(7.21) = (du® du— 5 (du, duh) (Z, Vv) 
= (du, Z)(du, v) — $(Z,v) (du, du). 


The calculation above establishes the following result. 


Lemma 7.3. If Z and v are nonzero timelike vectors pointing inside the forward 
light cone, then 
Ez, (du) is positive-definite in du. 


Note that the identity (7.16) is 


(7.22) i Ez,,,(du) dS = i Ezy, (du) dS. 
he Ny 


It follows that if O, as in Fig. 7.1, is swept out by spacelike surfaces, as in Fig. 7.2, 
then the same argument as given in 86 leads to the uniqueness result: Qu = 0 in 
O, wand du = 0 on } imply u = 0 in O, provided 2 has a timelike Killing 
field Z. This gives finite propagation speed for solutions to the wave equation on 
such a Lorentz manifold. 

If a Lorentz manifold Q has no timelike Killing field, which is typical, then 
natural energy identities such as (7.22) do not arise. However, there are inequali- 
ties involving the stress-energy tensor, that are powerful enough to imply the local 
uniqueness (finite propagation speed) of solutions to the wave equation Llu = 0 
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Z,(s) 


FIGURE 7.2 Spacelike Surfaces 


on a general Lorentz manifold. In the next section we will establish this as a spe- 
cial case of a more general result on hyperbolic equations. 


Exercises 


1. If M is a Lorentz manifold, S C M a hypersurface (codimension 1), show that S 
is spacelike if and only if the metric tensor restricted to S is positive-definite. In the 
product case (7.1), show that the definitions of “spacelike” given in this section and the 
previous one are equivalent. 

2. On R”++, with coordinates (ao,..-,2n), place the Lorentz inner product 


(u, v) = —UgUo + Uy, ees + Unt n- 
Show that A : R"*+! — R”*1, defined by 
A(uo, U1, U2,-.-,Un) = (U1, Uo, U2,.-., Un) 


is skew-adjoint for the Lorentz metric (i.e., (Au, v) = —(u, Av)), and hence the group 
F(t) = e'“ preserves the Lorentz metric. 
3. Consider the hyperboloids 


M = M, = {x ER”? : (a,x) = s}. 


Show that M, is spacelike if and only if s < 0. 

4. If s > O and M, is as in Exercise 3, show that /, gets a Lorentz metric, induced from 
R”"*'. Show that the group F(t) of Exercise 2 leaves M, invariant and its generator is 
a timelike Killing field on M,. 

5. We consider a general approach to constructing a second-order tensor of the form 


ik ike 
TT = A eum, 


where AJ**™ is a tensor field of type (4, 0), such that the conclusion (7.10) of Proposi- 
tion 7.1 holds. Let us assume that VA = 0. Show that 


jk jklm 
T’ k= B? U;k;LU;m, 


where 


B= P*p*A. 
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Here, P“” denotes the operation on tensors of type (4, 0) of symmetrizing with respect 
to the jth and vth indices, for example, (P??C)7**™ = (1/2)[C7"’™ + CF*™). Con- 
sequently, (8.10) holds provided 


Pp? P**A = H, Hiker = nin. 


6. Show that P” are all projections of the same rank and H belongs to the range of P?*. 
Show that Ker P?? ) R(P**) = 0 and hence 


P*? ; R(P**) —+ R(P?*) is an isomorphism. 


(Hint: If B € Ker P?* ) R(P**), show that B2*’™ — —BI™* (k 2m) (mk 2£) 
is a cyclic permutation of order 3, so apply this transformation three times.) 

7. Deduce that the equation P?? P*4A = H has a solution A, given uniquely, mod Ker 
P**, and hence that the tensor T/” = A? Kem Um is uniquely determined by the 
conditions set in Exercise 5. 

8. Show that, for general smooth scalar u, with T' defined by (7.6), then 


(7.23) div TZ = (Zu)Ou + (T, Def(Z)) 


where Def(Z) is the deformation tensor of Z, with components (1/2)(Z;.~ + Zx;;) and 
(T,V) = T?*V;x. This implies Lemma 7.2. Show that (7.23) follows from the general 
identity 


(7.24) div(TZ) = (Z, div T) + (T, Def Z). 


8. More general hyperbolic equations; energy estimates 


In this section we derive estimates for a solution to a nonhomogeneous hyperbolic 
equation of the form 


(8.1) lu=f inQ, 
where L is given in local coordinates by 
(8.2) Lu = hi* 0;Opu + bi (x) Oju + c(x)u. 


By definition, to say L is hyperbolic is to say that (hJ*) is a symmetric matrix of 
signature (n, 1), if dim 2 = n+ 1. One can then use the inverse matrix (hjx) 
to define a Lorentz metric on 2, and in view of the formula (7.7), we can write 
(8.2) as 


(8.3) fu=Ou+ Xu, 


for some first-order differential operator X on (2). 
Suppose O C 12 is bounded by two surfaces 4; and X2, both spacelike. As at 
the end of §7, we suppose that O is swept out by spacelike surfaces. Specifically, 
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FIGURE 8.1 Spacelike Bounded Regions 


we suppose that there is a smooth function on a neighborhood of O, which in fact 
we denote by ¢, such that dt is timelike, and set 


O(pSOntixal, Mels) =Ontt=—si. 


We suppose O is swept out by No(s), so < s < 81, as illustrated in Fig. 8.1, with 
Ne = No(s). Also set 
HP (s) = By {t < sh. 


As in (7.15), the divergence theorem implies 


(8.4) i] Ez,,(du) dS = i Ezy, (du) dS — / (div TZ) dV, 
La(s) x2 (s) O(s) 


where F’z,,,(du) is defined by (7.21) and T by (7.5), though at this point it is not 
physically meaningful in general to think of T as the stress-energy tensor. Here 1 
is the forward-pointing unit normal to 41, with respect to the Lorentz metric, and 
V2 is the normalization of grad t, the vector field obtained from dt via the Lorentz 
metric. Z is any timelike vector field; we will set Z = v2. Note that Lemma 7.3 
applies to the integrands Ez,,, (du). 

We no longer have div TZ = 0, but we can estimate this quantity, as follows. 
First, 


(8.5) div TZ = TF phyeZ? + TH hye Z, = (div T, Z) + (T,VZ). 


The term (7, V Z) is a quadratic form in du, and hence, by Lemma (7.7), we have 
an estimate 


(8.6) (7, VZ)| < K Ez,z(du). 


As for the first term on the right side of (8.5), (7.10) implies 
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(8.7) div T = (grad u)QOu. 


If wu satisfies Lu = f, this implies 

(8.8) div T = (grad u)(f — Xu). 
Cauchy’s inequality together with Lemma 7.3 gives an estimate 
(8.9) | (div T, Z)| < K Ez,z(du) + K|u)? + KI f|?. 


Consequently, (8.4) yields the estimate 


: Ez,z(du) dS < 
Xe2(s) 
/ Ez.,,(du) dS +K / [2B 7 (du) + |ul? + |f[2] av: 
) O(s) 


(8.10) 


ue(s 
Suppose that wu satisfies the following initial conditions on %j: 
(8.11) u=g, du=w on}. 


We want to estimate the left side of (8.10) in terms of f, g, and w. Our first goal 
will be to deive a variant of (8.10) without the |u|? term. We can work on the term 
J O(s) |u|? dV on the right side of (8.10) as follows. An easy consequence of the 
fundamental theorem of calculus, Cauchy’s inequality, and Lemma 7.3 gives 


(8.12) jf weavsc i wrasse ff Bzxl du) d 


O(s) E2(s) O(s) 


which can be applied to (8.10). 
At this point, it is convenient to set 


(8.13) Hs) = i Ez,7(du) dV. 
O(s) 


We will want to estimate the rate of change of E'(s). Clearly, 


dE 


(8.14) a 


<C / Ez,z( (du) dS, 


La(s) 


and hence, by (8.10)-(8.12), we have an estimate of the form 
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(8.15) i 2 Ont) 4 PG), 
ds 
where 
(8.16) F(s) =C f [Bz,2lw)+ lg?) as +c / fl? av. 
HT O(s) 


Note that (8.15) is equivalent to 


(8.17) £ (&°*H(9)) < e-“* F(s), 


and since (so) = 0, we have 
(8.18) e- © E(s) < / e ©" F(r) dr. 
So 
In view of (8.16), this establishes the following “energy estimate.” 


Proposition 8.1. [f u solves the hyperbolic equation Lu = f of the form (8.3), 
with initial data (8.11) on Xi, and if O(s) satisfies the geometrical hypotheses 
made above and illustrated in Fig. 8.1, then 


(8.19) [ Bez(du) av < C(s—s0) f [lg + bol as+c | fl? av, 
O(s) x1 O(s) 


for s € [80, 51]. 


In particular, if g and w vanish on 4; and f vanishes on Q, then (8.19) implies 
du = 0 on O, so wu is constant on O, that constant being g = 0. This gives 
the local uniqueness (finite propagation speed) for solutions to the homogeneous 
hyperbolic equation Lu = 0, extending the result of §7. 

We note that, using (8.10) and (8.12), we deduce from (8.19) that 


(8.20) ) Ez,z(du) dS < cf U9? + |w|?] dS+C / |f|? dV. 
Yo(s) 1 O(s) 


Exercises 


1. Prove the estimate 


ce 2 |u(s)|? ds < |u(0)|? + c. | ju’ (s)|? ds. 
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What is the best value of C, that will work? 
2. Give a detailed proof of the estimate (8.12). 
3. Sharpen the estimate (8.19) to 


(8.21) / Ez,z(du) dV < C(s— v0) f (9? + |w|?] dS + C(s— s0) / |f/? av, 
O(s) E1 O(s) 
under the hypotheses of Proposition 8.1. (Hint: Use (8.18) more carefully.) 


4. Work out generalizations of the energy estimates (8.10)-(8.19) when w satisfies the 
semilinear PDE 


(8.22) u= f(x,u, du). 


Formulate and prove a finite propagation speed result in this case. 
(Hint: Given solutions u; and wz to (8.22), derive a linear PDE for w = ui — uz, to 
get the finite propagation speed result.) 


9. The symbol of a differential operator and a general 
Green-—Stokes formula 


Let P be a differential operator of order m on a manifold M@; P could operate on 
sections of a vector bundle. In local coordinates, P has the form 


(9.1) Pu(x) = S- Da(x)D° u(x), 


Ja|<m 


where D® = Df --- D°, Dj = (1/i) 0/0x;. The coefficients p(x) could be 


ni? 


matrix valued. The homogeneous polynomial in € € R” (n = dim M), 


(9.2) Palast) = > ple”, 


|o|=m 


is called the principal symbol (or just the symbol) of P. We want to give an 
intrinsic characterization, which will show that p,,(x,€) is well defined on the 
cotangent bundle of MM. For a smooth function yw, a simple calculation, using the 
product rule and chain rule of differentiation, gives 


(9.3) P(u(x)e”) = [pm(x, dep)u(x)rX” + r(x, A)]e, 
where r(x, A) is a polynomial of degree < m—1 in A. In (9.3), pm(x, dy) is eval- 


uated by substituting € = (Ow/021,...,0w/Ox,) into (9.2). Thus the formula 


(9.4) Dm(x,dw)u(a2) = lim Ae” P(u(a)e”) 


A> 00 
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provides an intrinsic characteristization of the symbol of P as a function on 7* M. 
We also use the notation 


(9.5) op(2, €) = Pm(z, €). 
If 
(9.6) P:C%°(M, Ey) — C*(M, EF), 


where Eo and FE are smooth vector bundles over M, then, for each (2,&) € 
T*M, 


(9.7) Dm(, &) : Fox — Ey 


is a linear map between fibers. It is easy to verify that if P2 is another differential 
operator, mapping C™° (IM, E1) to C°(M, E>), then 


(9.8) op, p(x, €) = op, (x, E)op(z, €). 


If M has a Riemannian metric, and the vector bundles FE’; have metrics, then the 
formal adjoint P' of a differential operator of order m like (9.6) is a differential 
operator of order m: 


P*: C™(M, F,:) — C™(M, Ep), 
defined by the condition that 
(9.9) (Pu, v) = (u, Ptv) 


if wu and v are smooth, compactly supported sections of the bundles Kp and F}. If u 
and v are supported on a coordinate patch O on M, over which E, are trivialized, 
so u and v have components wu’, v’, and if the metrics on Eg and £; are denoted 
hes, hese: respectively, while the Riemannian metric is gj, then we have 


(9.10) (Pu, v) = [hros(e)(Pu)” v / g(a) dx. 
oO 


Substituting (9.1) and integrating by parts produce an expression for P’, of the 
form 


(9.11) Piya) = S” pila) D(a). 


|a|<m 
In particular, one sees that the principal symbol of P? is given by 


(9.12) opt(z,£) = op(a, £)'. 
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Compare this with the specific formula (2.22) for the formal adjoint of a real 
vector field, which has a purely imaginary symbol. 

Now suppose M is a compact, smooth manifold with smooth boundary. We 
want to obtain a generalization of formula (2.24), that is, 


(9.13) (Xu, v) — (u, X*v) = / (v, X)uv dS, 

aM 
to the case where P is a general first-order differential operator, acting on sections 
of a vector bundle as in (9.6). Using a partition of unity, we can suppose that u 
and v are supported in a coordinate patch O in M. If the patch is disjoint from 


OM, then of course (9.9) holds. Otherwise, suppose © is a patch in R". If the 
first-order operator P has the form 


(9.14) Pixs a; (a) =— + b(x)u, 
Jj 


then 


n 


(9.15) [iru v) fg dz = [do (2) 50) + (b(x)u, v) | /g dx. 
2) 


Oo J=1 
If we apply the fundamental theorem of calculus, the only boundary integral 
comes from the term involving 0u/02,,. Thus we have 
(9.16) 
[Puna dz = [ Pv) /g dx — / (an (a",0)u, v)\/g(x’,0) dz’, 
oO oO 


Rr-1 


where dz’ = dx, ---dx,_1. If we pick the coordinate patch so that 0/O0x,, is the 
unit inward normal at 0M, then ,/g(x’,0) dz’ is the volume element on 0M, and 
we are ready to establish the following Green—Stokes formula: 


Proposition 9.1. If M is a smooth, compact manifold with boundary and P is a 
jirst-order differential operator (acting on sections of a vector bundle), then 


(9.17) (Pu, v) — (u, P’v) = : | borla.vyuse) dS. 
aM 


Proof. The formula (9.17), which arose via a choice of local coordinate chart, is 
invariant and hence valid independent of choices. 


As in (9.13), v denotes the outward-pointing unit normal to OM; we use the 
Riemannian metric on MV to identify tangent vectors and cotangent vectors. 


Exercises 189 


We will see an important application of (9.17) in the next section, where we 


consider the Laplace operator on k-forms. 


Exercises 


1, 


wm 


Consider the divergence operator acting on (complex-valued) vector fields: 
div: C™(Q,C") — CQ), QCR". 
Show that its symbol is given by 


Odiv (2, E)v = iv, €). 


. Consider the gradient operator acting on (complex valued) functions: 


grad : C@(Q) — CM (Q,C"), QACR". 


Show that its symbol is 


Pera (at, €) = iE. 


. Consider the operator 


L = grad div : C™(Q,C") —+ C™(Q,C"). 


Show that its symbol is 
2 
ox(#,€) = —|€|"Pe, 
where Pe € End(C”) is the orthogonal projection onto the (complex) linear span of €. 


. What is the symbol of the operator 


P=wpA+(A+4+p) grad div, 


which appears in the equation (1.59) of linear elasticity? What are the eigenvalues of 
the symbol, for given € € R”? 


. Generalize Exercises 1—3 to the case of a Riemannian manifold. 
. Let L be aconstant-coefficient, second-order, homogeneous, linear differential operator 


acting on functions on R” with values in C*, of the form 


Lu= S> Ag D*u, Aa € End(C’). 


ja|=2 


Let € € R” \ 0. A “plane wave” solution to ure — Lu = Oisa C*-valued function 
u(t, x) of the form 


u(t, x) — v(t, x : f), 
with v(t, y) a C*-valued function on R x R. Show that the PDE for v becomes 


Utt — Movyy = 0, 


with 
M = —o1(z,&). 
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In case o1(z, €) is negative-definite with eigenvalues —c; = —c;(£)”, show that the 
initial-value problem for v can be solved in terms of the formula for the one-dimensional 
wave equation derived in §1. 

7. Consider the equation of linear elasticity from (1.59): 


mwee — pAw — (A+ p) grad div w = 0. 


Suppose 4 > 0,24 + A > 0. Fix € € R” \ 0. Using the results of Exercises 4 and 6, 
analyze plane wave solutions w(t, x) = v(t, x - €). Show that if n > 2, there are two 
propagation speeds. The faster and slower waves are called “‘p-waves” (pressure waves) 
and “s-waves” (shear waves), respectively. If n = 1, only p-waves arise. 


10. The Hodge Laplacian on k-forms 


If M is an n-dimensional Riemannian manifold, recall the exterior derivative 


(10.1) d: A®(M) —> A**1(M), 
satisfying 
(10.2) d? =0. 


The Riemannian metric on MV gives rise to an inner product on T* for each 7 € 
M, and then to an inner product on APTS via 


(10.3) (vp A+++ A vp, Wy A+++ A We) = S "(sgn T)(V1, Wr(1)) *** (Uk, Wa(k))s 


TT 
where 7 ranges over the set of permutations of {1,...,k}. Equivalently, if 
{€1,.--,€n} is an orthonormal basis of TM, then {e;, \--- A €;, 2 j1 < jo < 


- < jp} is an orthonormal basis of A*7*M. Consequently, there is an inner 
product on k-forms (that is, sections of A*) given by 


(10.4) (u,v) = cx dV (x). 


M 
Thus there is a first-order differential operator 


(10.5) 6: A**1(M) —> A*(M), 


which is the formal adjoint of d, that is, 6 is characterized by 


(10.6) (du,v) = (u,dv), ue A*(M), v € A‘*!(M), compactly supported. 


We set 6 = 0 on 0-forms. Of course, (10.2) implies 
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(10.7) 67 = 0. 
There is a useful formula for 6, involving d and the “Hodge star operator,” which 


will be derived in Chap. 5, §8. 
The Hodge Laplacian on k-forms, 


(10.8) A: A*(M) — A*(M), 

is defined by 

(10.9) —A=(d+56)? = dé + dd. 

Consequently, 

(10.10) (—Au, v) = (du, dv) + (du, dv), for u,v € Co°(M, A*). 


Since 5 = 0 on A°(M), we have —A = 6d on A°(M). 

We will obtain an analogue of (10.10) for the case where M is a compact 
manifold with boundary, so a boundary integral appears. To obtain such a formula, 
we specialize the general Green—Stokes formula (9.17) to the cases P = d and 
P = 6. First, we compute the symbols of d and 6. Since, for a k-form u, 


(10.11) d(u ec”) = ire™ (db) Au +e” du, 
we see that 

1 
(10.12) 7 Cale, Ju = Au. 


As a special case of (9.12), we have 
(10.13) o5(a,£) = oa(a,€)’. 


The adjoint of the map (10.12) from A*T* to A*+!T* is given by the interior 
product 


(10.14) leu =ulX, 


where X € Ty, is the vector corresponding to € € TJ. under the isomorphism 
T, » T* given by the Riemannian metric. Consequently, 


(10.15) : o5(x, €)u = —veu. 


Now, the Green—Stokes formula (9.17) implies, for 7 a compact Riemannian 
manifold with boundary, 
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(du, v) = (u, dv) + : J boa(e,»)u,0) ds 
(10.16) Ae 
= (u, dv) + for u,v) dS, 
aM 
and 
(du, v) = (u, dv) + . J kos(e,»)u,0) ds 
(10.17) ou 
= (u, dv) — [lowe dS. 
aM 


Recall that v is the outward-pointing unit normal to 0M. 
Consequently, our generalization of (10.10), and also of (4.8), is 


—(Au, v) = (du, dv) + (du, dv) 


(10.18) és : ‘i [(calx, v) Su, v) + o5(x,v) du, v)] dS 
aM 
or, equivalently, 
—(Au, v) = (du, dv) + (du, dv) 
(10.19) as / [(v A (5u),v) — (uy(du),v)] a8. 
aM 
Taking adjoints of the symbol maps, we can also write 
—(Au, v) = (du, dv) + (du, dv) 
(10.20) + i [(Su, 00) — (du,v Av)] ds. 


OM 


Let us note what the symbol of A is. By (10.12) and (10.15), 
(10.21) —7a(z,E)u= teE AU+E A teu. 


If we perform the calculation by picking an orthonormal basis for T’* of the form 
{e1,.--,€n} with € = |f|e1, we see that 


(10.22) oa(a,é)u = —|€|?u. 
In other words, in a local coordinate system, we have, for a k-form u, 


(10.23) Au = g(a) 0;0gu + Yeu, 
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where Y;, is a first-order differential operator. 

A differential operator P : C°(M, Eo) > C'™(M, E}) is said to be elliptic 
provided op(x,€) : Eg 3 Fj, is invertible for each x € M and each € # 0. By 
(10.22), the Laplace operator on k-forms is elliptic. 

Of course, the definition —A=dd for the Laplace operator on 0-forms 
coincides with the definition given in §4. In this regard, it is useful to note 
explicitly the following result about 6 on 1-forms. Let X be a vector field and & 
the 1-form corresponding to X under a given metric: 


(10.24) WY, X) = (¥,€). 
Then 
(10.25) 6€ = — div X. 


This identity is equivalent to (2.18) and the definition of 6 as the formal adjoint 
of d. 

We end this section with some algebraic implications of the symbol formula 
(10.21)-(10.22) for the Laplace operator. If we define \¢ : A*T > A*T* by 
Ag (w) = € Aw, and define tg as above, by (10.14), then the content of this calcu- 
lation is 


(10.26) Aete + tee = |€|?. 


As we have mentioned, this can be established by picking €/|| to be the first 
member of an orthonormal basis of T*. Extending the identity (10.26), we have 


(10.27) Agta + tye = (6,77); 


a result that follows from the formula (13.37) of Chap. 1. Note also the connection 
with (2.26). 


Exercises 


1. Show that the adjoint of the exterior product operator €/ is ve, as asserted in (10.14). 

2. Ifa = Yo aje(x) dx; A drz and a;* = g**aje, relate da to the divergence a;*.,,, as 
defined in (3.29). 

3. Using (10.20), write down an expression for 


(Au, v) — (u, Av) 


as a boundary integral, when wu and v are k-forms. 

4. Relate the characterization (10.3) of the inner product on A*T¥ arising from an inner 
product on T°, to that given in the following section, before (11.24). 

5. Letw € A"(M), n =dim M, be the volume form of an oriented Riemannian manifold 
M. Show that dw = 0. (Hint: Compare (10.6)with the special case of Stokes’ formula 
Jy du = 0 for u € A"—'(M), compactly supported.) 
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6. Given the result of Exercise 5, show that Stokes’ formula f NE du = f ant Us foru € 
A"—'(M), follows from (10.16). 
7. If f € C~(M) and u € A*(M), show that 


6(fu) = fou — tcapyu. 
8. Fora vector field u on the Riemannian manifold M, let & denote the associated 1-form. 
Show that 
5(& A 0) = (div uv) — (div u)é — [u,v], 
for w, 0 € A'(M). Reconsider this problem after reading Chap. 5, §8. 


11. Maxwell’s equations 


The equations governing the electromagnetic field are one of the major triumphs 
of theoretical physics. We list them here, for the electric field F and the magnetic 
field B, in a vacuum: 


(11.1) div B =0, 
B 

(11.2) a + curl E = 0, 

(11.3) div E = 4rp, 
EB 

(11.4) a — curl B = —4rJ. 


Here, p is the charge density and J the electric current. Units are chosen so that the 
speed of light is 1. Here we are glossing over the distinction between two types 
of electric field, typically denoted & and D, and two types of magnetic field, 
typically denoted B and H, and their relation via “dielectric constants.” Material 
on this may be found in texts on electromagnetism, such as [Ja]. 

Of the four equations above, (11.1) and (11.3) have a relatively elementary 
character. Equation (11.3), known as Gauss’ law, follows in the case of stationary 
charges from the statement that a charge e at a point p € R® produces an electric 
field 

t—?p 

ic |x — pi?’ 

which is Coulomb’s law. Equation (11.1) is the statement that there are no mag- 
netic charges. Both of these laws are well supported by experiments. We note 
parenthetically that there is reason to believe that at high energies magnetic 
charges might exist. A theoretical framework for this is provided by a modifi- 
cation of the theory of the electromagnetic field, called the “electroweak theory.” 
But that is a story that we will not try to relate in this book. As one reference, we 
mention [IZ]. 
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The equations (11.2) and (11.4) are more subtle. Equation (11.2), which 
implies that a changing magnetic field produces an electric field, is called Fara- 
day’s law. One implication of (11.4) is that an electric current produces a magnetic 
field; this is exploited in electric motors. The first quantitative expression of this 
effect written down was 


curl B = 47J, 


which is valid when all quantities involved are independent of time. It breaks down 
when variation with time is allowed. Indeed, the left side must have vanishing 
divergence, but in the time-varying case one has, not div J = 0, but rather the 
following law of conservation of charge: 


ap 


a a = 


(11.5) 
Maxwell produced the modification (11.4), which completed the set of equations 
for the electromagnetic field. 

Careful thought about the implications of Maxwell’s equations, together with 
the experimental fact that two observers moving with respect to each other would 
measure the speed of light to be the same, led to the development of Einstein’s 
theory of relativity. We will not discuss how this was done. Rather, following 
J. Wheeler, we will reverse the historical order. We will rewrite (11.1)-(11.4) 
in an invariant fashion, depending only on the Lorentz metric —dx% + dx? + 
dax3 + dx3 on Minkowski spacetime IR* rather than a particular Cartesian prod- 
uct decomposition of R* into time R and space R*. We can then show that, within 
the relativistic framework, the subtle equations (11.2) and (11.4) actually follow 
from the “simple” equations (11.1) and (11.3). 

We bring in the 2-form (with t = x9) 


3 
(11.6) F= )0 Bj da; A dt + By dx2 A dr3 + By dr3 \ dx, + B3 day A dag. 


1 


In 819 of Chap. 1 it was shown how this form arises naturally in the relativistic 
expression of how the electromagnetic field acts on a charged particle to make it 
move. A calculation of the exterior derivative gives 


3 

B 

(11.7) dF= we + curl E) (sda) A dt + (div B) dx, A dz A dz, 
1 

where, for 1 < 7 < 3, we set 


xdx; = daz, \dxe, (j,k, €) acyclic permutation of (1, 2,3). 


Consequently, (11.1) and (11.2) together are equivalent to the equation 


196 2. The Laplace Equation and Wave Equation 
(11.8) dF =0. 


On the other hand, (11.1) alone is equivalent to the following. For fixed T,, define 
kr : R° > R¢ by kr(2’) = (T, 2’). Then (11.1) holds at ¢ = T if and only if 


(11.9) KydF = 0. 
Now, in the relativistic set-up, any physical law that is valid on all surfaces ¢ = 


const. in R* should be valid on all spacelike hyperplanes in R*. But the following 
result is easy to establish. 


Lemma 11.1. Let 0 < k < 3, and suppose a € A*(R*) has the property that 
(11.10) Kta =0, 


for every inclusion tk : S —> R* of spacelike hyperplanes in R*. Then a = 0. 


Applying this to a = dF, we see how (11.1) yields (11.2). 
We will be able to rewrite(11.3)-(11.4) using the “adjoint” to d: 


(11.11) d®™ ; A*(R*) — AP-(R4), 


defined like 6 = d* in §10, but using an inner product coming from the Lorentz 
metric. Thus, for compactly supported wu, 


(11.12) L(du, v) = L(u,d* v), 
for a (k — 1)-form wu and a k-form v, where the inner product of two k-forms v; is 


(11.13) L(v1, 02) = [(oes) deo “++ d&3, 


the integral of the pointwise inner product, characterized as follows. 
A form dz;, \--- A dx,, with distinct 7,,’s has square norm €;, --- €;,, where 


€9 = —1, €1 = €2 = €3 = 1. Two such forms are orthogonal unless their sets of 
indices {71,..., J} coincide. A straightforward calculation yields 
(11.14) d* gue(a) day A dae = — S © €5€(i, 53k, 2) BGK ay, 
= J gf 9 Ox; gj? 
where 
(11.15) e(i, jk, €) = (dx; A du;, dx, A dxe) 
is characterized above. This is 0 unless {7,7} = {k,2}, and we can rewrite 


(11.14) as 
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rs) 0 
(11.16) d® gpo(x) dx, \ dxy = e(k, & k,0) |e oon dip — Ep ae ey 
This implies 
3 : OE; 
(11.17) oa dz; \ dzo = — (div EF) dro — La 4 v5, 
and 


3 
(11.18) d* (By dr \dx3+ Bo dx3 Adz, +B3 dx, Adz2) = So (curl B), dx;. 
il: 


Thus (11.3) and (11.4) together are equivalent to the equation 


(11.19) d®*F =4An7’, 
where 
3 
(11.20) T° =-pdt+ > I, dag. 
al 


Thus 7° is the 1-form associated via the Lorentz metric to the vector 
(11.21) J =(p,J), 


called the charge-current 4-vector. 
In this case, (11.3) alone is equivalent to the identity 


(11.22) (d* F — snJ*)| 5 = 0. 


Again, in the relativistic set-up, such a physical law ought to be independent of 
the choice of timelike vector field with which to take the interior product. Thus, if 
we assume that F has an invariant significance as a 2-form and also that 7 > has 
an invariant significance as a 1-form, we are in a position to apply the following. 


Lemma 11.2. [f1 < k < 4.anda € A*(R*) has the property that 
(11.23) a|V =0 


for all timelike vectors V, then a = 0. 


Applying this to a = d®* F — 47°, we see how (11.3) yields (11.4). 
The pair of Maxwell equations (11.8), (11.19) make sense on any Lorentz man- 
ifold of dimension 4 and provide the appropriate equations for an electromagnetic 
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field in curved spacetime. To define d*, one uses the formula (11.12), replacing 
dx --+-dx3 by the natural volume element on a general Lorentz manifold M in 
(11.13). 

This construction defines d* for Lorentz manifolds of any dimension. The 
inner product in the integrand in (11.13) can be characterized as follows. To the 
Lorentz inner product on V = T;,M corresponds an isomorphism Q : V + V’ 
satisfying Q’ = Q (with V” = V). This induces isomorphisms 


Qn: AFV — APV! & (APV)’, 


with the same symmetry property, yielding inner products on A*V, 0 < k < 
m = dim M. Equivalently, if you pick an “orthonormal” basis {vo,...,Um—1} 
of V, satisfying (vo,vo) = —1, (vj,vj) = 1 for 1 < 7 < m—1, then the 
characterization given after (11.13) is easily extended. 

In analogy with (10.9), it is of interest to form the second-order operator 


(11.24) —O = (d+ d*)? = dd* + d*d. 


A calculation similar to (10.23) gives 


(11.25) u = hi" (x) 0;Oeu + Yeu, 


for a k-form u, where (h?°) is formed from the Lorentz metric tensor, as in (7.7), 
and Y;, is a first-order differential operator. On 0-forms, this operator is exactly 
(7.7). For Minkowski spacetime R*, O is just —0?/Ox2 + ei 0? /da*, acting 
on each component of a k-form. 

The equations dF = 0, d* F = 477° imply that F satisfies the “wave equa- 
tion” 


(11.26) F = —Ar dg’. 


The results developed in §8 for scalar hyperbolic operators of the type (8.2) are 
easily extended to cover the operator L] constructed here, which by (11.25) has 
scalar principal part. 

In particular, finite propagation speed arguments apply to solutions to 
Maxwell’s equations. Existence of solutions, including propagation of electro- 
magnetic waves in regions bounded by perfect conductors, is studied in Chap. 6. 

The energy in an electromagnetic field in R* = R x R? is 


(11.27) Vi - flees + |B(t,x)|?] dx. 
R3 


If (11.1)— (11.4) hold, then 
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(11.28) on += (S2) | (=) 


= (curl B, E) — (curl E, B) — 4r(J, E). 


If E(t, x) and B(t, x) decrease sufficiently rapidly as |x| > 00, we have 
(11.29) (curl B, FE) = (B, curl E), 


as can be established by integration by parts. Hence 


dV _ 


(11.30) a= - [ 1,2) - E(t, x) da. 


R3 
In particular, for J = 0 we have conservation of V(t). 


One can construct a stress-energy tensor 7 due to the electromagnetic field, by 
an argument similar to that of §7. First note that, with F given by (11.6), we have 


(11.31) Fy ele 
Equivalently, 
(11.32) Tr F? = 2(|E|? — |B|?), 


where F is the tensor field of type (1, 1) associated to F. Note also that (F?) - = 
|E|?. Thus a natural construction of T giving rise to Joo = (1/87) (|E|?+|B|?) is 


ae 1 7-2 1 72 _ 
(11.33) TS -—_(F — 7(F )1) = 


: Pod) 


1 
An 
where 7 is the tensor field of type (1,1) associated with 7. In index notation, 
1 m 1 mn 
(11.34) T= ay Fim = gliiF Fadel 


where (h; 3) is the Lorentz metric tensor. In this case, in analogy with (7.10), one 
obtains 


(11.35) THR = —FI,F*, 


provided the Maxwell equations (11.8) and (11.19) hold. Equivalently, with T 
denoting the tensor field of type (2, 0) associated with T, 


(11.36) div7 =—-FJ. 
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If the electromagnetic field F is defined on a Lorentz 4-manifold which is 
simply connected, the equation dF = 0 implies the existence of a 1-form A such 
that F = dA. We can define the Lagrangian 


oe 


(11.37) L=—— 


1 


with inner product as in (11.31). The action integral J(A) = [ L dV satisfies, for 
a compactly supported 1-form (, 


(11.38) <1(A+70)| = = f (a3,dA) av =—= | (9,a*aa) av, 


0 Tp 


so the stationary condition 5 f L dV = 0 is equivalent to d*dA = 0, that 
is, to the rest of Maxwell’s equations (11.19), in case J = OQ. Thus (11.37) 
is the appropriate Lagrangian for the electromagnetic field, in order to produce 
Maxwell’s equations in empty space. If the current 7 is given (subject to the 
condition d* 7 = 0), and F = dA, then the (11.19) is the stationary condition 
6 [ L dV = 0 for the Lagrangian 


(11.39) L=-2UF,F)+(AS). 

In typical problems the current is not given in advance, but is itself influenced 
by the electromagnetic force. The nature of the influence involves the masses of 
the substances that carry charges, whose motion produces the current. Then the 
Maxwell equations are coupled to other equations, which are often nonlinear. We 
describe a model for one example. 

Suppose we have a diffuse cloud of electrons, in otherwise empty space. We 
model this as a continuous charged substance, whose motion is described by a 
4-velocity vector field u, satisfying (u,u) = —1, yielding a current 7 = ou, 
where o dV is the charge density, measured by an observer whose velocity is u. 
Taking a cue from the Lagrangian (19.20) of Chap. 1, derived to reflect the rela- 
tivistic Lorenz force law, we use the Lagrangian 


ol 


(11.40) Lim 


(FF) tA.) + lets) =1,+12+Ls, 

where py dV is the mass density, measured by an observer whose velocity is u. We 

are assuming that only one type of matter is present, so o is a constant multiple 

of ju. In more general cases there would be additional terms in the Lagrangian. 
We look at [(A,u) = I, + Ig + Iz. The term J3 is independent of A, and as 

above we have 


(11.41) 2 A+7B,W)|,< = | [-y(8.a*aa) + (6,9) dV. 
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The stationary condition this yields is again the Maxwell equation (11.19). Next 
we compute (0/07)I(A, u(r)) re where u(T) is a one-parameter family of 
velocity fields on M, obtained by varying the electron trajectories. There is no 
variation in [;, so we need to consider Jz and J3. 

We first treat the variation of J3, in a manner parallel to the calculations 
(11.17)— (11.26) in Chap. 1, leading to the geodesic equations. To do this, we 
parameterize the electron trajectories by X :Qx I> M, X(y,s) = 2, u= 
O0,X. We suppose the mass density is constant in (y, s)-coordinates, say m, 
so m dy ds = yu dV. Since u = O/Os in (y,s)-coordinates, this implies 
Lu(u dV) = 0, or 


(11.42) div (uu) = 0, 


where div is computed using the Lorentz metric on M/. Our hypothesis amounts 
to the law of conservation of matter. If we vary this map, using X(y, 5,7), then 


1 1 
(11.43) +f guluw) dV = / ~mL,(u, u) dy ds = feta) dy ds, 
dr J 2 2 
where 0,X = w. Using [0,,0,] = 0, convert this last integral to 
(11.44) = [ mw, Yun) dy ds + m f Lo(w.u) dy ds. 


The last integral here vanishes for a compactly supported perturbation, by the 
fundamental theorem of calculus, so 


(11.45) £15(A.ul?))| 0 — = | (Feu) dy ds = — [eo nun) dv. 


We now treat the variation of Iz, also using (y, s)-coordinates. Since o is a 
constant multiple of j,, we have o dV = e dy ds for some constant e, and, parallel 
to (11.42), we have conservation of electric charge, div(cu) = 0 (e., div J = 0), 
which is equivalent to (11.5) when M is Minkowski space. We have 


(11.46) = [tas dV = [ebwlAu) dy ds. 


We use the identity £,,A = dA|u + d(A]w) to write 


Lw(A,u) = —(dA)(u, w) + (L.A, w) 


(11.47) = —(dA)(u, w) + Ly(A,w) — (A, Ly). 


Since dA = Ff, [0s,0,] = 0, and £,,(A, w) integrates to zero, we have 
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d ~ = 

(11.48) = [Aan a = f e(Fu,w) ay ds = | (FI,u) dV. 
T 

Together with (11.45), this gives 
O = 

(11.49) 571 (A: u(7)) |p =— | (uVuu— FT, w) av. 

Thus the stationary condition for variation of wu is 

(11.50) uVyu—FI =0 or, equivalently, V,,u—— Fu =0, 

m 


which is the Lorentz force law in this context. 

It is useful to consider what the stress-energy tensor should be when we have 
the Lagrangian (11.40). It is reasonable to take it to be the sum of the stress- 
energy tensor 7. for the electromagnetic field, given by (11.34), and a stress- 
energy tensor J,,, associated with the “matter field.” If we want T;,(Z, Z)dV to 
be the mass-energy density of the electrons observed by one moving with velocity 
Z, then it is natural to set 


o~ 


(11.51) Tm = U® u, 
(i.e., 7,3* = wusu*). Then the total stress-energy tensor is given by 


il 


ik 
(11.52) TH = 7 


Plage le a , 
(Fier _ zh FF) + pater, 


The divergence of 7, is given by (11.36), provided the Maxwell equation (11.19) 
holds. Furthermore, (wu u*).;,. = (uu").,.u? + puPu.,, so 

(11.53) div Tm = div(yu)u+ uVyu. 

Thus, for T — 7. + Tes we have (granted (11.19)) 

(11.54) div 7 = div(wu) + pV,u— FT. 

We have the conservation law div T = 0 for a solution to the coupled Maxwell— 
Lorentz equations. Indeed, the vanishing of the first term on the right side of 
(11.54) is equivalent to the matter conservation law (11.42), and the vanishing of 


the sum of the other terms on the right side of (11.54) is equivalent to the Lorentz 
force law (11.50). 


Exercises 


1. Demonstrate Lemmas 11.1 and 11.2. 
2. Verify the calculations (11.14)-(11.18). 
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3. Show that the inner product of forms defined after (11.13) depends only on the Lorentz 
metric on R*, not on the coordinate representation. 
4. Show that div curl = 0 is a special case of dd = 0. 
5. Show that (11.3)-(11.4) imply the “conservation law” (11.5). 
(Hint: Apply 0/0t to (11.3) and div to (11.4); use div curl = 0.) 
Show that (11.5) is equivalent to d* 7° =0. 
6. Verify the identity (11.29), for any compactly supported vector fields E(x) and B(x) 
on R?. 
7. Prove the conservation law (11.36), as a consequence of Maxwell’s equations. 
8. Show that the identity dF = 0 is equivalent to 
Fyne + Freij + Fej;k = 0. 
9. Show that the identity d* F = 477° is equivalent to 
F ie n= 4nJ!. 
10. The equation dF = 0 on R‘ implies F = A for some 1-form A on R*. A is not 
unique, as any 1-form du. can be added. Show that A can be picked to satisfy d*.A = 0 
and that, for such A, 
A= —-4r 7S ee 
(Hint: Set up a PDE for u. Look for the relevant existence theorem in Chap. 3.) 
11. The calculation (11.31) of (F, F) shows that | B|? — ||? is Lorentz invariant. Calcu- 
late F \ F and show that E - B is also Lorentz invariant. 
12. Think about the fact that the tensor T given by (11.33) is trace-free, i.e., Tr T=0. 
What is the trace of the stress-energy tensor defined by (7.5) or, equivalently (7.11)? 
13. As mentioned in Exercise 5 in §19, Chap. 1, a sign change in the Lorentz metric, from 
one of signature (—,+,+,-+) to one of signature (+,—, —, —) (which some people 
prefer), leads to a sign change in the formula for the 2-form F (though no change in 
the tensor field F of type (1, 1)). Show that it leads to a sign change in the formula 
(11.34) for the stress-energy tensor of the electromagnetic field. 
What sign changes arise in the formula (11.40) for the Lagrangian of an electromag- 
netic field coupled to charged matter? 
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updates 


Fourier Analysis, Distributions, 
and Constant-Coefficient Linear PDE 


Introduction 


Fourier analysis is perhaps the most important single tool in the study of linear 
partial differential equations. It serves in several ways, the most basic—and histor- 
ically the first—-being to give specific formulas for solutions to various linear PDE 
with constant coefficients, particularly the three classics, the Laplace, wave, and 
heat equations: 


Oru Ou 


with A = 0?/Ox7 +--+. + 0?/0x2. The Fourier transform accomplishes this by 
transforming the operation of 0/0x; to the algebraic operation of multiplication 
by i€;. Thus the equations (0.1) are transformed to algebraic equations and to 
ODE with parameters. 

Before introducing the Fourier transform of functions on Euclidean space R”, 
we discuss the Fourier series associated to functions on the torus T” in §1. Meth- 
ods developed to establish the Fourier inversion formula for Fourier series, in the 
special case of the circle S'=T", provide for free a development of the basic 
results on harmonic functions in the plane, and we give such results in §2, noting 
that these results specialize further to yield standard basic results in the theory of 
holomorphic functions of one complex variable, such as power-series expansions 
and Cauchy’s integral formula. 

In §3 we define the Fourier transform of functions on R” and prove the Fourier 
inversion formula. The proof shares with the argument for Fourier series in §1 the 
property of simultaneously yielding explicit solutions to a PDE, this time the heat 
equation. 

It turns out that representations of solutions to such PDE as listed in (0.1) 
are most naturally done in terms of objects more general than functions, called 
distributions. We develop the theory of distributions in §4. Fourier analysis works 
very naturally with the class of distributions known as tempered distributions. 
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Section 5, in some sense the heart of this chapter, derives explicit solutions to 
the classical linear PDE (0.1) via Fourier analysis. The use of Fourier analysis 
and distribution theory to represent solutions to these PDE gives rise to numerous 
interesting identities, involving both elementary functions and “special functions,” 
such as the gamma function and Bessel functions, and we present some of these 
identities here, only a smattering from a rich area of classical analysis. Further 
development of harmonic analysis in Chap. 8 will bring in additional studies of 
special functions. 

Fourier analysis and distribution theory are also useful tools for general inves- 
tigations of linear PDE, in cases where explicit formulas might not be obtainable. 
We illustrate a couple of cases of this in the present chapter, discussing the exis- 
tence and behavior of “parametrices” for elliptic PDE with constant coefficients, 
and applications to smoothness of solutions to such PDE, in §9 and proving local 
solvability of general linear PDE with constant coefficients in §10. Fourier analy- 
sis and distribution theory will acquire further power in the next chapter as tools 
for investigations of existence and qualitative properties of solutions to various 
classes of PDE, with the development of Sobolev spaces. 

Sections 11 and 12 deal with the discrete Fourier transform, particularly with 
Fourier analysis on finite cyclic groups. We study this both as an approximation to 
Fourier analysis on the torus and Euclidean space, sometimes useful for numerical 
work, and as a subject with its intrinsic interest, and with implications for num- 
ber theory. In §12 we give a brief description of “fast” algorithms for computing 
discrete Fourier transforms. 

We end this chapter with some appendices. The first discusses the Gaussian 
and relates it to the development of basic results about the Euler gamma func- 
tion. The second takes a distributional approach to the central limit theorem of 
probability theory, germane to the treatment of Brownian motion in Chapter 11. 
The third shows that weak* convergence of measures often leads to apparently 
stronger convergence results, of interest in various formulations of the central 
limit theorem. 


1. Fourier series 


Let f be an integrable function on the torus T”, naturally isomorphic to R”/Z” 
and to the Cartesian product of n copies St x --- x S1 of the circle. Its Fourier 
series is by definition a function on Z” given by 


7 1 
(1.1) fk) =aa5 | fOe*" a, 
(27) i 


where k = (ki,...,kn), k- 0 = ki, +--+ + kyO,. We use the notation 
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(1.2) F f(k) = f(k). 
Clearly, we have a continuous linear map 
(1.3) F: L'(T") —+ €*(Z"), 


where (°°(Z") denotes the space of bounded functions on Z”, with the sup norm. 
If f € C~(T”), then we can integrate by parts to get 


(1.4) k* f(k) = oo i: (D° f)(0) e** 6, 
[Tn 

where k* = kit --+k@~, and 

(1.5) D* = D™... D%®, Ds = 55g 

It follows easily that 

(1.6) F:C™*(T") — s(Z"), 


where s(Z") consists of functions wu on Z” which are rapidly decreasing, in the 
sense that, for each NV, 


(1.7) pn(u) = sup (k)% |u(k)| < oo. 
keZn 


Here, we use the notation 
(k) = (1+ [a))?, 


where |k|? = k? + +--+ k?. If we use the inner product 


(1.8) (Pie a J f(0)g(0) a9, 


for f,g € C™(T”), or more generally for f,g € L?(T”), and if on s(Z”), or 
more generally on ¢?(Z”), the space of square summable functions on Z”, we use 
the inner product 


(1.9) (u,v) = (u,v)e = > u(k)o(h), 


kez 


we have the formula 
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(1.10) (Ff, ule =(f,F*u)z, 

valid for f € C(I”), u € s(Z”), where 

(1.11) F* : 8(Z”) —> C%(T") 


is given by 

(1.12) (Fru) (0) = S> ulk) e*?. 
kezn 

Another identity that we will find useful is 


1 


(1.13) Gay 


pe eit do = One, 


qT” 


where Ope = 1 if k = £ and dz¢ = O otherwise. 
Our main goal here is to establish the Fourier inversion formula 


(1.14) FO) = So flaye**, 


keZn 
the sum on the right in (1.14) converging in the appropriate function space, 


depending on the nature of f. Let us single out another space of functions on 
T”, the trigonometric polynomials: 


(1.15) TP = S- a(k)e’*® : a(k) = 0 except for finitely many cS 
kezn 
Clearly, 


(1.16) F:TP — coo(Z"), 


where Co9(Z”) consists of functions on Z” which vanish except at a finite number 
of points; this follows from (1.13). The formula (1.12) gives 


(1.17) F* : coo(Z") — TP, 
and the formula (1.13) easily yields 
(1.18) FF* =I oncoo(Z"), 


and even 
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(1.19) FF* =I ons(Z”). 
By comparison, the inversion formula (1.14) states 
(1.20) FF =I, 


on C®(T”), or some other space of functions on T”, as specified below. Before 
getting to this, let us note one other implication of (1.13), namely, if 


(1.21) 5(0) = — pi(kye*? 
k 


are elements of TP, or more generally, if y; € s(Z”), then we have the Parseval 
identity 


(1.22) (fis fanz = Y_ vilk 


keznr 


in particular, the Plancherel identity 


(1.23) Ifillz2 = 35 le), 


kez” 


for f; € TP, or more generally for any f; of the form (1.21) with py; € s(Z”). 
In particular, the map F* given by (1.12), and satisfying (1.11) and (1.17), has a 
unique continuous extension to (7(Z”), and 


(1.24) F* : O(Z") — L(T") 


is an isometry of £?(Z”) onto its range. Part of the inversion formula will be that 
the map (1.24) is also surjective. 

Let us note that if f; € TP, satisfying (1.21), then (1.13) implies f)(k) = 
:p;(k), so we have directly in this case: 


(1.25) F*F =I on TP. 


One approach to more general inversion formulas would be to establish that TP 
is dense in various function spaces, on which F*F can be shown to act contin- 
uously. For more details on this approach, see the exercises at the end of § land 
§2 in the Functional Analysis appendix. Here, we will take a superficially differ- 
ent approach. We will make use of such basic results from real analysis as the 
denseness of C(T”) in L?(T”), for 1 < p < . 

Our approach to (1.14) will be to establish the following Abel summability 
result. Consider Abel summability 
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(1.26) tA O= > Tes 


kezn 
where |&| = |ki|+---+|Kn|, r € [0, 1). We will show that 
(1.27) Inf +f, asr 71, 


in the appropriate spaces. The operator JJ, in (1.26) is defined for any f € L1(T”), 
if r < 1, and we have the formula 


(1.28) Jr f (0) = (20)~ {10% ei O-8) ag’. 


kez” 
The sum over Z” inside the integral can be written as 


rlAl eth (0-8) — P,(r,6 — 0’) 


(1.29) keZr 
= pr, 0, = 6) cee pr, On, —_ ae 
where 
vr.) = So rift et? 
k=—oco 
(1.30) =e ae eikO rhe ik0) 


_ eee 
~ 1 —2rcos@ +r?" 


Then we have the explicit integral formula 


J,.f (0) = (20)-” / f(0)P,(r,6 — 6’) db! 


(1.31) ve 


= (ony f (0- 6’) P,,(r, 0") dé’. 
TT? 
Let us examine p(r, @). It is clear that the numerator and denominator on the 
right side of (1.30) are positive, so p(r,@) > 0 for each r € [0,1), 0 € S'. Of 


course, as r_/ 1, the numerator tends to 0; as r_“ 1, the denominator tends to a 
nonzero limit, except at 0 = 0. Since it is clear that 


(1.32) ery | p (r,0) dO = ( nf Soret dd = 1, 


Sl 
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Q+r/a-n 


y =p(r, 8) 
(r =0.8) 


tt 0 +14 


FIGURE 1.1 Poisson Kernel 


we see that, for r close to 1, p(r, @) as a function of @ is highly peaked near 0 = 0 
and small elsewhere, as in Fig. 1.1. 

We are now prepared to prove the following result giving Abel summability 
(1.27). 


Proposition 1.1. [f f € C(T”), then 
(1.33) J,f — f uniformly on T” asr 7/1. 
Furthermore, for any p € [1, 00), if f € L®(T”), then 


(1.34) Jr f > fin L?(T")asr Al. 


The proof of (1.33) is an immediate consequence of (1.31) and the peaked nature 
of p(r, 0) near 8 = 0 discussed above, together with the observation that, if f is 
continuous at 0, then it does not vary very much near . The convergence in (1.34) 
is in the L?-norm, defined by 


1/p 
(1.35) ligllc» = | en)” / \9(0) |? a0 


TT” 
We have the well-known triangle inequality in such a norm: 
(1.36) lg + gallee < lgillze + llgellze, 


and this implies, via (1.31) and (1.32), 
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|Jefllue = 2x)" f Pars 0" rorf a6 
qT" 
(1.37) 
< enyn f Pa(r 6’) |\t9 fll Le do” 
= |lfllz, 
where 
(1.38) To f(0) = f(a—6’), 
which implies ||79 f||z» = || f||z». In other words, 
(1.39) lFrllccne) <1, l<p<o, 


where we are using the operator norm on L?: 


(1.40) IIT cae) = sup {TF llc» : |[fllze < 1}. 


Using this, we can deduce (1.34) from (1.33), and the denseness of C(T”) in each 
space L?(T”), for 1 < p < ov. Indeed, given f € L?(T”), and given « > 0, find 
g € C(T”) such that || f — g|| 2 < €. Note that, generally, ||g||z° < ||g||sup-. Now 
we have 


Jef — flue < lef — g)llze + llJrg — gllze + Ilo — filze 


(1.41) 
<é+|\Jpg—gllz~ +e, 


making use of (1.39). By (1.33), the middle term is < ¢ if r is close enough to 1, 
so this proves (1.34). 


Corollary 1.2. If f €¢ C™~(T"), then the Fourier inversion formula (1.14) holds. 


Proof. In such a case, as noted, we have f € s(Z"), so certainly the right side 
of (1.14) is absolutely convergent to some f# € C(T”). In such a case, one a 
fortiori has 


(1.42) lim SO f(h)ritlet*? = 77 (0). 


But now Proposition 1.1 implies(1.42) is equal to f(0) (i.e., f7* = f), so the 
inversion formula is proved for f € C™(T”). 


As a result, we see that 


(1.43) Fea") 0 (1) 
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is surjective, as well as injective, with two-sided inverse F : C°(T”) > s(Z”). 
This of course implies that the map (1.24) has dense range in L?(T”); hence 


(1.44) F* : O(Z") — L?(T") is unitary. 
Another way of stating this is 
(1.45) {e'® : k © Z"} is an orthonormal basis of L?(T"), 


with inner product given by (1.8). Also, the inversion formula 


(1.46) F*F =I onC™(T") 
implies 
(1.47) |F fle = lIfllze, 


so therefore F extends by continuity from C™(T”) to a map 
(1.48) F:L?(T") — (Z"), unitary, 


inverting (1.44). The denseness C%°(T”) C L?(T”) C L'(T”) implies that this 
F coincides with the restriction to L?(T”) of the map (1.3). Note that the fact 
that (1.44) and (1.48) are inverses of each other extends the inversion result of 
Corollary 1.2. 

We devote a little space to conditions implying that the Fourier series (1.14) 
is absolutely convergent, weaker than the hypothesis that f € C™(T”). Note 
that since |e**®| = 1, the absolute convergence of (1.14) implies uniform conver- 
gence. By (1.4), we see that 


(1.49) fe CUT") => | F(R) < C(Ry, 
which in turn clearly gives absolute convergence provided 
(1.50) £>n+1. 


Using Plancherel’s identity and Cauchy’s inequality, we can do somewhat better: 


Proposition 1.3. If f ¢ C(T”), then the Fourier series for f is absolutely con- 
vergent provided 


n 
(1.51) L>5. 


Proof. We have 
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DEFOR) = Diy AYA) 
k 


k 


(1.52) <a)” Dower]: 


as long as (1.51) holds. The square of the right side is dominated by 


OS* Se |e f)|? =C' SE DF 2 


(1.53) k |y|<e lise 
<C"Wlflle, 


so the proposition is proved. 


Sharper results on absolute convergence of Fourier series will be given in 
Chap. 4. See also some of the exercises below for more on convergence when 
n=l. 


Exercises 


1. Given f,g € L'(T”), show that 
with 


2. Given f,g € C(T”), show that 


(fa)(k) = S> fk — m)g(m). 


3. Using the proof of Proposition 1.3, show that every f € Lip(S*) has an absolutely 
convergent Fourier series. 

4. Show that for any f € L1(T"), f(k) > Oas |k| — oo. 
(Hint: Given ¢ > 0, pick f- € C°(T”), || f — fellz1 < e. Compare f-(k) and f(k).) 
This result is known as the Riemann—Lebesgue lemma. 

5. For f € L'(S"*), set 


N 
(1.54) Swf(0)= S_ flk)e”’. 
k=—N 


Show that Sw f(@) = (1/2m) f™_ f(@ — v)Dn (vy) dy, where 


Exercises 217 


FIGURE 1.2 Dirichlet Kernel 


N 1 

N+4)6 

(1.55) Dy(0) = > ei? = can +a) 
Fame sin 59 


(Hint: To evaluate the sum, recall how to sum a finite geometrical series.) 
Dy (6) is called the Dirichlet kernel. See Fig. 1.2. 
6. Let f € L'(S") have the following property of “vanishing” at 9 = 0: 


£(9) 


J = 9(0) € Lm, 2). 
2 


Show that Sv f(0) + 0 as N — ov. 
(Hint. Adapt the Riemann—Lebesgue lemma to show that 


g€L'(-1,07) > g(9) sin(N + $)0 dd + 0 as N > .) 


—T 


7. Deduce that if f € L1(S") is Lipschitz continuous at 0, then Sw f (0) > (00) as 
N — oo. Furthermore, if f is Lipschitz on an open interval J C S$’, then Sy f > f 
uniformly on compact subsets of J. 

8. Let f € L™(S") be piecewise Lipschitz, with a finite number of simple jumps. Show 
that Sy f(@) — f(@) at points of continuity. If f has a jump at 6;, with limiting values 

f+(0;), show that 


(1.56) Sn f(0;) > slits (6s) + f-(65)], 


as N — oo. 
(Hint: By Exercise 7, it remains only to establish (1.56). Show that this can be reduced 
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to the case 0; = 7, f(0) = 0, for —a < 6 < Tm. Verify that this function has Fourier 
series 
2 ( v sin k0.) 


Alternative: Reduce to the case 6; = 0 and note that Sy f(0) depends only on the 


even part of f, (1/2)[f(0) + f(—@)].- 
9. Work out the Fourier series of the function f € Lip(S*) given by 


f(0) =|0|, -w7<O0< 7. 
Examining this at 0 = 0, establish that 
ak ne 
(1.57) So a 
k=1 


10. One can obtain Fourier coefficients of functions * and |6|* on [—7, 7] in terms of the 
Fourier coefficients of 


; -1 
Do gh) Go" =- a [Cyr JZ) 
and use this to work out the Fourier series for these functions. Apply this to Exercise 9, 
and to the calculation at the end of Exercise 8. 
11. Assume that g € L'(S') has uniformly convergent Fourier series (Svg — g) on 
compact subsets of an open interval J C $'. Show that whenever f € L1(S") and 
f = gon J, then f also has uniformly convergent Fourier series on compact subsets 
of J. 
(Hint: Apply Exercise 7 to f — g.) 
This result is called the localization principle for Fourier series. 
12. Suppose f is Hélder continuous on S$", that is, f € C"(S*), for some r € (0,1), 
which means 
If(e +) — fel < Clal’. 
Show that f has uniformly convergent Fourier series on S*. 
(Hint: We have 
f(O+ 9) — F(e) < C|sin oy Neen 
sin 50 
Apply Exercise 6.) 
13. Ifw : [0,00) — [0, 00) is continuous and increasing and w(0) = 0, we say a function 
f on S' is continuous with modulus of continuity w provided 


If(y +8) — f(y)| < Cw((4)). 


Formulate the most general condition you can to establish uniform convergence of the 
Fourier series of a function with such a modulus of continuity. Note that Exercise 12 
deals with the case w(s) = s", r € (0,1). 

14. Consider the Cesaro sum of the Fourier series of f: 


2. Harmonic functions and holomorphic functions in the plane 219 


N 


- 0 wT 


FIGURE 1.3 Fejer Kernel 


N 


Cu § (9) = D> (1- SF) fe = = Yo Sef). 


k=—N £=0 


Show that Cy f (0) = (1/2m) f”_ f p) Fn (y)dp, where 


N-— N-1 - oN 2 
1 . 1 1 fsin 39 


= a” on sin 59 


The function Fy (0) is called the Fejer kernel (see Fig. 1.3). Modify the proof of 
Proposition |.1 to show that 


Cnuf— fin B, for f eB, 


where B is one of the Banach spaces C'(.S") or L?(S*), 1 <p < co. 
(Hint: To evaluate the second sum in (1.58), use sin(¢ + 4)0 = Im e*?/e*? and sum 
a finite geometrical series. Also use the identity 2 sin? z = 1 — cos 2z.) 


2. Harmonic functions and holomorphic functions 
in the plane 


The method of proof of the Abel summability (1.26)-(1.27) of Fourier series, 
specialized to T' = $+, has important implications for the theory of harmonic 
functions on a domain 2 C R?, which we will discuss here. In the case of S*, let 
us rewrite (1.26), 
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(2.1) = So fare", 
k=—00 
as 
(2.2) (PI f)(r, 4) 25 fk 
k=—0o 


The function u(r,@) = PI f(r, 0) is called the Poisson integral of f. If we use 
polar coordinates in the complex plane C = R?, 


(2.3) z=re, 

then (2.2) becomes 

(2.4) ae )= 2 Fk as d. Ie 
= (PL, f)(z) + PL f)(2), 


defined on the unit disk |z| < 1. Note also, from (1.30), that 


_ jy2 
(2.5) PI f(z) = : 7 | / — ds(w), 
gi 


the integral being with respect to arclength on S'. Recall that if f € L'(S*), the 
function f (k) is bounded, so both power series in (2.4) have radius of convergence 
at least 1. Clearly, on the unit disk, v(z) = (PL, f)(z) is holomorphic and w(z) = 
(PI_ f)(z) is antiholomorphic. In other words, v and w belong to C*°(D), where 
D={zEC: |z| < 1}, and 


Ov Ow 
(2.6) ae 0, rr 0 on D, 
where 
0 1/0 _0 0 1/0 _O 
OD Oz os +i5)): oe i igs) 
Note that 


00 +00 31 A 
Oz OZ Oz Oz 4” 
where A is the Laplace operator on R?, a special case of the Laplace operator 


introduced in Chap. 2. Since v, w € C*°(D), we have Av = 0 and Aw = 0, and 
hence 
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(2.8) A(PI f) = 0. 
In light of the results of §1, we have the following. 


Proposition 2.1. If f € C(S*), then 

(2.9) u= PI f(z) € C?(D)NC(D) 

is harmonic, with boundary value f, that is, u solves the Dirichlet problem 
(2.10) Au=0inD, ulop =f. 


One should expect that if f has extra smoothness on S', so does PI f on D. The 
following result is crude compared to results established in Chaps. 4 and 5, but it 
will be of some interest. 


Proposition 2.2. For € = 1,2,3,..., we have 
(2.11) POS |) SO). 


Proof. We begin with the case £ = 1. Since we know from (2.4) that PI f € 
C™(D), we need merely check smoothness near 0D = S'. Clearly, 


a) _ae OF 
(2.12) ag rif = Pl ae 


so if f € C1(S"), then Of /00 € C(S") and we have (0/06)PI f continuous on 
D. Also, by (2.2), 


(2.13) © ptf =PI(Nf), 
Or 


where Nf is characterized by the Fourier series representation 


(2.14) Nf (0) = So f(R)|hle*?. 
k=—oo 
Thus 
0. OF 
(2.15) Nf = i 5g Hf = i =a 


where H has the Fourier series representation 


CO 


(2.16) Hg(6) = S~ (sgnk)g(k)e’*?. 


k=—0oo 
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We claim that, for 2 > 0, 
(2.17) Hoo" = C715"), 


and hence N : C*+?(S) > C*(S"). Given this, for f € C?(S), the quantity 
(2.13) is seen to belong to C(D), and this finishes the = 1 case of (2.11). 

In turn, since H commutes with 0/09, it suffices to establish the = 0 case of 
(2.17). Now, by Proposition 1.3, the Fourier series for g € C'(S") is absolutely 
convergent, giving (2.17). 

To prove the general case of (2.11), a short calculation yields 


k k 
(2.18) (2) (3) as=n((3) ws). 


Note that N’ = (—i)/(0/00)/ H°), where o(j) is zero if j is even and one if j 
is odd, so, for 2 > 0, 


Ni: C83*+1(91) — 0f(84), 


the left side being improved to C+! if j is even. Therefore, if f ¢ C’+1(S) and 
j+k = 4, the right side of (2.18) is PI fj, with fj, € C(S'), which proves 
(2.11) in general. 


The implication f € C*(S') = PI f € C*(D) does not quite work, as we 
will see later, essentially because H does not map C‘(S') to itself. It is true that 
f € C%*(S1) = PI f € C“°(D), for a € (0,1). This is a special case of Hélder 
estimates that will be established in §7 of Chap. 13. Similarly there are “sharp” 
results on regularity of PI f in Sobolev spaces, discussed in Chap. 4, and in much 
greater generality, in Chap. 5. 

It is important to know that PI f provides the unique solution to the Dirichlet 
problem (2.10). We will establish several general uniqueness results, starting with 
the following. 


Proposition 2.3. Let 2 C R" be a bounded region with smooth boundary, say, 
Q = D. Suppose u,v € C?(Q), with u = v = f on OQ, and Au = Av = 0 inQ. 
Then u = v onall of Q. 


Proof. Set w = u—v € C?(Q); w = 0 on OQ. We can apply the Green identity 
(3.15) of Chap. 2, to write 

Ow 
(2.19) (dw,dw) = —(Aw,w)+ | w— dS. 


Ov 
fok@) 


By hypothesis the right side of (2.19) is 0. Thus w is constant on each component 
of Q, and the boundary condition forces w = 0. 
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In view of Proposition 2.2, we could apply this to u = PI f if f € C%(S'), but 
this is not a satisfactory result, and we will do much better below. 

A result related to our uniqueness question is the mean-value property, a spe- 
cial case of which is the following. 


Proposition 2.4. If f ¢ C(S'), u= PI f, then 


1 


~ On 


‘ (0) dO. 


TT 


(2.20) u(0) 


Proof. It follows from the series (2.2) that u(0) = f(0), which gives (2.20). 


A more general result is the following. 


Proposition 2.5. If Br C R” is the open ball of radius R, centered at the origin, 
with OBR = Spr, of area A(R), then foru € C?(Br) NC(Br), Au = 0, we 
have 


(2.21) u(0) = aap / u(x) dS. 
Sr 


Proof. We apply Green’s identity 
(2.22) [ude —vAu] dz = / fuze _ | dS, 
Q aa 


to = B,,0<r< R, v(x) = |2|?, with Av = 2n, to get, when Au = 0, 


(2.23) nf u(z) dx = rf u(x) dS, 

B, 5, 
noting that substituting v = 1 in (2.22) gives [5 (Ou/Ov) dS = 0. If we let 
g(r) = Jz. u(x) dz, this implies y'(r) = (n/r)y(r), and hence y(r) = Kr”, 
ie. V(r)~' J, u(x) dz is constant. Passing to the limit r + 0 gives (2.21). 


Second Proof. Define v € C?(Br) 1 C(Br) by 


»(2) = / nilgx) dg, 
SO(n) 


where dg is Haar measure on the rotation group SO(n), defined in §6 of Appendix 
B. The Laplace operator is invariant under rotations, so Av = 0 on Br. The 
function v is radial; v(a) = 0(|z|). The formula 
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0? n-10 1 
ca r Bp t pats: 


(cf. (4.17) of Chap. 2), for A in polar coordinates, where Ag is the Laplace oper- 
ator on the unit sphere S”~1, gives 


@ n-1ld\. 
(+ - =) tr) =0. 


This is an Euler equation, whose solutions are 


A+Br?-, n>3, 


A+Blogr, n=2. 


Since v does not blow up at 0, we have B = 0, so v is constant. Clearly v(x) 
equals the right side of (2.21) for |x| = R, and v(0) = u(0), so we again have 
(2.21). 


Corollary 2.6. For any Q C R” open, any u € C?(Q) harmonic, any ball By, 
centered at p and contained in Q, we have 


(2.24) u(p) = Avgoz, u(z). 


We can now prove the following important maximum principle for harmonic func- 
tions. Much more general versions of this will be given in Chap. 5. 


Proposition 2.7. Let 9 C R” be connected and open, and let u € C? (Q) be 
harmonic and real-valued. Then u has no interior maximum, or minimum, unless 
u is constant. In particular, if Q is bounded and u € C(Q), then 


(2.25) sup u(p) = sup u(q). 
pEQ qeEon. 


Also, even for u complex-valued, 
(2.26) sup |u(p)| = sup |u(q)|. 
pEQ qeaQ 


Proof. That a non-constant, real harmonic function has no interior extremum is 
an obvious consequence of (2.24), and the other consequences, (2.25) and (2.26), 
follow immediately. 


Corollary 2.8. The uniqueness result of Proposition 2.3 holds for any bounded 
open Q C R”, with no smoothness on OQ, and for any harmonic 


(2.27) u,v € C?(Q)NC(Q). 
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Proof. Apply the maximum principle to u — v. 


Here, we have used the mean-value property to prove the maximum principle. The 
more general maximum principle established in Chap.5 will not use the mean- 
value property; indeed, together with a symmetry argument, it can be made a 
basis for a proof of the mean-value property. We will leave these considerations 
until Chap. 5. 

With our uniqueness result in hand, we can easily establish the following inte- 
rior regularity result for harmonic functions. 


Proposition 2.9. Let Q C R? be open, and let u € C?(Q) be harmonic. Then in 
fact, u € C™(Q); u is even real analytic on Q. 


Proof. By translations and dilations, we can reduce to the case Q = D, u € 
C?(D). The uniqueness result of Corollary 2.8 implies 


(2.28) u=PIf, where f = uls:. 


Then the conclusion that u is real analytic on D follows directly from the power- 
series expansion (2.4). 


Parenthetically, we remark that, by Corollary 2.8, the identity (2.28) holds for any 
u € C?(D) MN C(D) harmonic in D. 

Using the results we have developed, via Fourier series, about harmonic func- 
tions, we can quickly draw some basic conclusions about holomorphic functions. 
If Q Cc C is open, f:Q — C is by definition holomorphic if and only if 
f € C1(Q) and Of /OZ = 0, where 0/0Z is given by (2.7). Clearly, if f € C?(Q) 
is holomorphic, then it is also harmonic, and so are its real and imaginary parts. 
Suppose u € C?(D) M C(D) is holomorphic in D. Then the series representation 
(2.4) is valid, since (2.28) holds. This series is a sum of two terms: 


(2.29) 


where Ou, /0Z = 0 and Ou2/0z = 0. But if we are given that 0u/OzZ = 0, then 
also Ou2/0Z = 0, so Ou2/Ox = Ou2/Oy = 0. Thus uz is constant, and since 
u2(0) = 0, this forces uz = 0. In other words, the holomorphic function u(z) has 
the power series 


(2.30) u(z) = S- ayz”, 26D, 
k=0 


where a, = f(k), k > 0. Note that differentiation of (2.30) gives 
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1/fa\* 
(2.31) a% = (5) u(0). 


By the usual method of translating and dilating coordinates, we deduce the 
following. 


Proposition 2.10. If QC Cis open,wEe C “(0)) holomorphic, p € Q, and Dy a 
disk centered at p, Dp C Q, then on Dy, u(z) is given by a convergent power- 
series expansion 


=: 
Ay 
lI 
Me 
Ss 
= 
my 
XR 
| 
Ss 
> 


(2.32) 
k=0 


We can relax the C?-hypothesis to u € C1(Q). As much stronger and more gen- 
eral results are given in Chap. 5, we omit the details here. 
For further use, we record the following result, whose proof is trivial. 


Lemma 2.11. /fu € C®(Q) is holomorphic, and 


i Hk 
P= > oD, 
is any constant-coefficient differential operator, then Pu is holomorphic in Q. 
Proof. (0/0Z)Pu = P(0u/02Z). 


We can use the power-series representation (2.30)—(2.32) to prove the fundamen- 
tal result on uniqueness of analytic continuation, which we give below. Here is 
the first result, of a very general nature. 


Proposition 2.12. Let 2. C R” be open and connected, and let u be a real- 
analytic function on Q. If p € Q and all derivatives D°u(p) = 0, then u = 0 
on all of Q. 


Proof. Let K = {x € 2: D®u(x) = 0 forall a > O}. Since u € C™(Q), K is 
closed in .. However, for each p € K, since u is given in a neighborhood of p by 
a power series 


PAC) 
u(q) = >> ) (a—p)" 


a! 
a>0 


we also see that K is open in 2. This proves the proposition. 
Our basic corollary for holomorphic functions is the following. 


Corollary 2.13. Let 2 C C be open and connected, u holomorphic on Q. Let ¥ 
be a line segment contained in Q. If u|, = 0, then u = 0 on Q. 
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Proof. Translating and rotating, we can assume 7 is a segment in the real axis, 
with 0 € y. Near 0, u(z) has a power-series expansion of the form (2.30), with 
dx given by (2.31). Using Lemma 2.11, we see that 


(2.33) (=) ; u(0) = (=) ‘ u(0), 


which vanishes for all /. Thus u = 0 on a nonempty open set in (2, and the rest 
follows by Proposition 2.12. 


Actually, a much stronger result is true. If 2 C C is connected, p; € Q are 
distinct, p; + p € Q, wis holomorphic in Q, and u(p;) = 0 for each j, then u 
must vanish identically. In other words, u can have only isolated zeros if it does 
not vanish identically. Indeed, say u(p) = 0. If u is not identically zero, some 
coefficient in the series (2.32) is nonzero; let b,, be the first such coefficient: 


u(z) = (z—p)™ Pe bm+a(z — p)* 
(2.34) k=0 


= (2 ~p)"v(2), 


where vu(z) is holomorphic on D, and v(0) = 6b», #0. Thus, by continuity, 
u(¢) # 0 for |¢ — p| < ¢ if © is sufficiently small, which implies u(¢) 4 0 if 
¢ # p but |¢ — pl <e. 

A typical use of Corollary 2.13 is in computations of integrals. We will see an 
example of this in the next section. 

We end this section by recalling the classical Cauchy integral theorem and 
integral formula. Throughout, ( will be a bounded open domain in C with smooth 
boundary. Stokes’ formula, proved in Chap. 1, §13, states 


(2.35) // da = Jo 
Q aa 


for a 1-form a with coefficients in C1(Q). If a = p dx + q dy, this gives the 
classical Green’s formula 


(2.36) [vawrad= ff Og oP dx dy. 
Ox Oy 
Q 


dQ. 


If u(x, y) € C'(Q) is a complex-valued function, we consequently have 
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fude= [war tin dy) 


0Q. 


an 
Ou Ou 
(2.37) 7 iG a 7 dx dy 
Q 


In the special case when u is holomorphic in 2, we have the Cauchy integral 
theorem: 


Theorem 2.14. Jf Q ¢ C is bounded with smooth boundary and u € C1(Q) is 
holomorphic, then 


(2.38) fe dz = 0. 
02. 


Using various limiting arguments, one can relax the hypotheses on smoothness of 
OQ and of u near OQ; we won’t go into this here. Next we prove Cauchy’s integral 
formula. 


Proposition 2.15. With Q as above, u € C?(Q) holomorphic in Q, we have 


(2.39) Oe / ae) dz, for CEQ. 


Proof. Write 


u(z)(¢ — z)~* = (¢— 2)" [u(z) — u(d)] + u(C)(¢ - 2)? 


(2.40) 
= v(z) +.u(C)(¢— 27°. 


By the series expansion for u(z) about ¢, we see that v(z) is holomorphic 
near ¢; clearly, it is holomorphic on the rest of 2, and it belongs to c1(Q), so 
Jog v(z) dz = 0. Thus, to prove (2.39), it suffices to show that 


(2.41) Je —()"'dz=2mi, for Ce 0. 
0a 


Indeed, if ¢ is small enough that B(¢,¢) = {z € C: |z —¢| < €} is contained in 
Q, then Cauchy’s theorem implies that the left side of (2.41) is equal to 
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(2.42) / (2—(¢)-1 dz 
OB(¢,e) 


since (C — z)~+ is holomorphic in z for z 4 ¢. Making a change of variable, we 
see that (2.42) is equal to 


27 
: (e a ie e”? do = 2ri, 
0 


so the proof is complete. 


As stated before, the C?-hypothesis can be relaxed to C?. 

A function u(z) of the form u(z) = vu(z)/(z —a)*, k € Z*, where v is 
holomorphic on a neighborhood O of a, is said to have a pole of order k at z = a 
if u(a) # 0. In such a case, a variant of the preceding calculations yields 


a | we) dz = oi 


k—D! 


the coefficient of (z — a)*~1 in the power series of v(z) about z = a, if y isa 


smooth, simple, closed curve about a such that v is holomorphic on a neighbor- 
hood of the closed region bounded by +. This quantity is called the residue of 
u(z) atz =a. 

One can often evaluate integrals by evaluating residues. We give a simple 
illustration here; others are given in (2.48), (3.32), (A.14), and (A.15). Here we 
evaluate 


dx dz 
2.43 = ji gees 
Coe - CC Rex) 1422" 
YR 


where yp is the closed curve, going from —F to R along the real axis, then from 
R to —R counterclockwise on the circle of radius FR centered at 0, that is, yp = 
JdOr, where On = {z : Re z > 0,|z| < R}. There is just one pole of (1+ 27) 7! 
in Op, located at z = i. Since (1 + z?)~! = (z +i)~1(z — i)“, we see that the 
residue of (1 + 27)~! at z = iis 1/2i, so 


a dz 
= 
oo 1+ 2? 


Exercises 


1. Suppose wu satisfies the following Neumann boundary problem in the disk D: 


(2.44) Au =0inD, a =gonS'. 
rT 
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If u = PI(f), show that f and g must be related by 9(k) = |k|f(k), for all k € Z, 
that is, 


(2.45) g9=Nf, 


2. 


with N defined by (2.13). 
Define the function ky by 


kn (0) = >_ |kl*e*?. 


k#0 


Show that ky € L?(S1) Cc L'(S1). Also show that, provided g € L?(S") and 


(2.46) / (0) do =0, 


gl 


a solution to (2.45) is given by 


(2.47) #(0) = (2m) f kx (6 p)a(e) de = 790) 


10. 


11. 


gl 


If T is defined by (2.47), show that T : L?(St) — L?(S") for p € [1,00) and 
T : C*(S*) = C*(S") for £=0,1,2,.... 

Given g € C1(S'), show that (2.44) has a solution u € C1(D) if and only if (2.46) 
holds. If g € C*(S"), show that (2.44) has a solution u € C*(D). 

Note: Regularity results of a more precise nature are given in Chap. 4, §4, in Exercise 
1. See also Chap. 5, §7, for more general results. 

Let 2 C R? be a smooth, bounded, connected region. Show that if w € C?(Q), Aw = 
0 on Q, and Ow/dv = 0 on OQ, then w is constant. (Hint: Use (2.19).) 

Note: One can weaken the C?-hypothesis to w € C+(); compare Proposition 2.2 of 
Chap. 5. For another type of relaxation, see Chap. 4, §4, Exercise 3. 

Show that a Ct-function f : Q — C is holomorphic if and only if, at each z € 
Q, Df (z), a priori a real linear map on R?, is in fact complex linear on C. 

Note: This exercise has already been given in Chap. 1, §1. 

Let f be a holomorphic function on 2 C C, with f : Q — O, and let u be harmonic 
on O. Show that v = wo f is harmonic on Q. (Hint: For a short proof, write wu locally 
as a sum of a holomorphic and an anti-holomorphic function.) 

Let g(z) = >7f° axz*, and form the harmonic function u = 2 Re g = g + g. Show 
that, under appropriate hypotheses on (az), g|g1 = P+(ulg1), where P, is given by 


P, f(0) = S- fikjet*. 


k=0 


Find a holomorphic function on D that is unbounded but whose real part has a contin- 
uous extension to D. 

Reconsider this problem after reading 86 of Chap. 5. 

Hence show that P; does not map C(S") to itself, nor does it map any C“(S') to 
itself, for any integer ¢ > 0. 

Hence find f € C1(S*) such that PI f ¢ C'(D). 
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12. Use the method of residues to calculate 


co est 
: ——~ > 0. 
(2.48) Le dz, €>0 


(Hint: Write this as lim p_.o0 i. e'§* /(1+2z7) dz, with yr as in (2.43), given € > 0. 
Then find the residue of e$*/(1 + 2”) at z =i.) 

13. Use the Poisson integral formula (2.5) to prove the following. Let u € C?(D) MN C(D) 
be harmonic in the disk D = {z € C: |z| < 1}. Assume u > 0 on D and u(0) = 1. 
Then 


(2.49) |z| =a € [0,1) = u(z) => 


This result is known as a Harnack inequality. Hint. Use the inequality 


1—|z/? _ 1-a 
jw—2|2? ~ 1+a’ 


|w| = 1, |2z| =a € (0,1) 


Note. By translating and scaling, if wu is harmonic and > 0 on Dr(p) = {2 € C: 
|z —p| < R}, then 


le pl =a€[0,R) > u(z) > F* 


u(p). 


14. Using Exercise 13, show that if u is harmonic in the entire plane C and u > 0 on C, 
then u is constant. More generally, if there exists a constant K such that u > K on 
C, then wu is constant. This is a version of Liouville’s theorem. See Proposition 4.6 for 
another version. 

15. Using Exercise 13, show that there exists A € (0, co) with the following property. Let 
u be harmonic on Dr(0). Assume 


u(0)=0, u(z) <M on Dr(0). 


Then 
u(z) > —-AM on Dpr/2(0). 


(Hint. Set v(z) = M — u(z), so u(z) > 0 on Dr(0), v(0) = M. Say p € Drj2(0), 
u(p) = inf, /»(0) u- Deduce that 


1 
vu(z) = 3M —u(p)) on Drya(p), 
and from there that A 
AVE 2 (0) ve T6 3(M —u(p)), 


while this average is equal to v(0) = M.) 
16. Assume wu is harmonic on C and 


u(z)<Co+Cilz|*, Vee, 
with Co, C1 € (0,00). Take A from Exercise 15. Show that there exists C2 such that 


u(z) > —Co— ACi|z|", VzeEC. 
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Note. In conjunction with Proposition 4.6, one gets that u(z) must be a polynomial of 
degree < kin x and y. 
17. Let Q be the strip Q = {z € C: 0 < Rez < 1}, with closure 2. Let f be continuous 
on 2 and holomorphic on 2. Show that if f is bounded, 
|f(2)| < A on 00 => |f(z)| < A on Q. 


(Hint. First prove the implication under the additional hypothesis that | f(z)| — 0 as 
|z| > oo. Then consider f-(z) = e* f(z).) 
18. In the setting of Exercise 17, show that if 0 € (0,1), 
If(ty)| < A, |f(+%y)| < B, Vy ER => [f+ iy)| < A’°B’, VyeR. 


This result is known as the Hadamard three-lines lemma. (Hint. Consider g(z) = 
A™1B-* f(z),) 
3. The Fourier transform 


The Fourier transform is defined by 
(3.1) F f(€) = f(Q) = (2a)? / Flaje de 
when f € L1(R”). It is clear that 


(3.2) F: L'(R") 4 L®(R”). 


This is analogous to (1.3). The analogue for C(I”), and simultaneously for 
s(Z”), of §1, in this case is the Schwartz space of rapidly decreasing functions: 


(3.3)  S(R") = {ue C™(R”) : 2° D%u € L©(R") for all a, 8 > 0}, 


where 29 = aft... 28, D* = D®... D%, with Dj = —i0/Oz;. Note that 
also 
u€ S(R") = 2° Due L'(R”). 


It is then easy to verify that 


(3.4) F : S(R") — S(R") 
and 
(3.5) e°DEF FE) = (-1)!"|F(D"2" fy). 


We define F* by 
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3.6) FF) = fe) = em"? | swe ae, 


which differs from (3.1) only in the sign of the exponent. It is clear that F* 
satisfies the mapping properties (3.2), (3.4), and 


(3.7) (Fu,v) = (u, F*v), 


for u,v € S(R”), where (u,v) denotes the usual L?-inner product, (u,v) = 
f u(x) v(x) dex. 

As in the theory of Fourier series, the first major result is the Fourier inversion 
formula. The following is our first version. 


Proposition 3.1. We have the inversion formula 
(3.8) F*F =FF* =I on S(R"). 


As in the proof of the inversion formula for Fourier series, via Proposition 1.1, 
in the present proof we will sneak up on the inversion formula by throwing in 
a convergence factor that will allow interchange of orders of integration (in the 
proof of Proposition 1.1, the orders of an integral and an infinite series were 
interchanged). Also, as we will see in §5, this method will have serendipitous 
applications to PDE. So, let us write, for f € S(R”), 


FF F(e) =n)” [| f senyer*ay] et at 
(3.9) 


= -n ij —el€|? pi(w—y)-€ 
@ny-r tim ff fy) ens” 0S dy as, 


We can interchange the order of integration on the right for any ¢ > 0, to obtain 


G.10) FF f(a) = tim | fy)ple.e—y) ay, 
where 

(3.11) ple, 2) = Cy ake dé. 
Note that 

(3.12) pe, 2) =e"? g(e“/?2), 


where g(x) = p(1, 2). Ina moment we will show that 
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(3.13) p(e,z) = (4me)77/2 ele? /4e, 
The derivation of this identity will also show that 
(3.14) q(x) dx = 1. 
Rn 


From this, it follows as in the proof of Proposition 1.1 that 


G.15) tim [ flu)ple.0 — 9) dy = f(a), 


for any f € S(R”), even for f bounded and continuous, so we have proved 
F*F = I on S(R”); the proof that FF* = I on S(R”) is identical. 
It remains to verify (3.13). We observe that p(¢,x), defined by (3.11), is an 


entire holomorphic function of x € C”, for any ¢ > 0. It is convenient to verify 
that 


(3.16) ple, ix) = (4me)~"/? el®l"/42 ge ER", 


from which (3.13) follows by analytic continuation. Now 


p(e, ix) = (20)-” / e-@e-elel? ge 


242 1s 2 
(3.17) = (2n)-nel@ ite fe |n/2ve+veel? ge 
= Qn) Permcleie | ee ae 
R” 
To prove (3.16), it remains to show that 
(3.18) petra = 7/2, 
R 
Indeed, if 
(3.19) A= / e 8 dé, 


then the left side of (3.18) is equal to A”. But for n = 2 we can use polar 
coordinates: 


27 [oe] 
(3.20) A? = peas = | | er dr dd =n. 
0 0 


R2 
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This completes the proof of the identity (3.16) and hence of (3.13). 


In light of (3.7) and the Fourier inversion formula (3.8), we see that, for u,v € 
S(R” ) P 


(3.21) (Fu, Fv) = (u,v) = (F*u, F*v). 


Thus F and F* extend uniquely from S(R”) to isometries on L?(IR") and are 
inverses to each other. Thus we have the Plancherel theorem: 


Proposition 3.2. The Fourier transform 
(3.22) F: L?(R") — L?(R") 
is unitary, with inverse F*. 


The inversion formulas of Propositions 3.1 and 3.2 do not provide for the inversion 
of F in (3.2). We will obtain this as a byproduct of the Fourier inversion formula 
for tempered distributions, in the next section. 

We make a remark about the computation of the Fourier integral (3.11), done 
above via analytic continuation. The following derivation does not make any 
direct use of complex analysis. It suffices to handle the case « = 1/2, that is, 
to show 


(3.23) G(®) =e |? if G(x) =e "/2, on R®. 


We have interchanged the roles of x and € compared to those in (3.11) and (3.13). 
It suffices to get (3.23) in the case n = 1, by the obvious multiplicativity. Now the 


Gaussian function G(#) = e~*/2 satisfies the differential equation 
(3.24) (+ )G(e) =0 
. dg + 2) Gl) = 0. 


By the intertwining property (3.5), it follows that (d/dé + €)G(€) = 0, and 
uniqueness of solutions to this ODE yields G(€) = Ce-£’/2. The constant C 
is evaluated via the identity (3.20); C = 1; and we are done. 

As for the necessity of computing the Fourier integral (3.11) to prove the 
Fourier inversion formula, let us note the following. For any g € S(R”) with 


g(0) = 1, G(E) = els? being an example), we have (replacing € by 57), just as 
in (3.9), 


FF f(x) = (2m) tim [fF Fly)g(de)e"* dy dg 
(3.25) 
= tim f fuy)hs(e—») a, 


where 
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hs(v) = (2m) f gf d8)e"** ag 
= (2n)—"/26-"G(6-12). 


(3.26) 


By the peaked nature of hs as 6 > 0, we see that the limit in (3.25) is equal to 
(3.27) C f(x), 


where 


(3.28) C= [ialo) dx = ny"? | a2) dex. 


The argument (3.25)-(3.27) shows that C’ is independent of the choice of g € 
S(IR"), and we need only find a single example g such that g(a) can be evaluated 
explicitly and then the integral on the right in (3.28) can be evaluated explicitly. 
In most natural examples one picks g to be even, so g = g. 

We remark that one does not need to have g € S(R”) in the argument above; it 
suffices to have g € L'(R"), bounded and continuous, and such that g € L'(R”). 
An example, in the case n = 1, is 


(3.29) qoae 


In this case, elementary calculations give 


(3.30) a(x) = (=) ea 


T a? 1’ 


compare (5.21). In this case, (3.28) can be evaluated in terms of the arctangent. 
Another example, in the case n = 1, is 


gf) =1—|é| if |é) <1, 


3.31 
( ) 0 if |€| > 1. 
In this case, 
sin ba = 
(3.32) aa) = (2m)? (==) . 
2 


and (3.28) can be evaluated by the method of residues. The calculation of (3.32) 
can be achieved by evaluating 


fo — €)cos 2€ dé 
0 


via an integration by parts, though there is a more painless way, mentioned below. 
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We now make some comments on the relation between the Fourier transform 
and convolutions. The convolution wu * v of two functions on R” is defined by 


(3.33) 


Note that ux v =v * u. If u,v € S(R"), then u* v € L1(R”). In fact, 
lu * vllze@ry S llullzallollze, 
so the convolution has a unique, continuous extension to a bilinear map 
(3.34) L'(R") x L?(R") —> L?(R"), 
for 1 < p < oo; one can directly perceive that this also works for p = ov. 


Note that the right side of (3.10), for any ¢ > 0, is an example of a convolution. 
Computing the Fourier transform of (3.33) leads immediately to the formula 


(3.35) F(u*v)(€) = (2m)"/7a(€)0(6), 
for u,v € L'(IR”). Using this, we can establish the following. 
Proposition 3.3. We have 
(3.36) u,v € S(R”) = uxv € S(R"). 
Proof. It is elementary that 

a, 6 € S(R”) = td € S(R"). 
Also, by (3.34), u* v € L?(IR”). Hence 


uxv = (2r)"/2F* (ad) € S(R"). 


We also note that if 


(3.37) Pes ye aoe 
lal<k 


is a constant-coefficient differential operator, we have 


(3.38) P(ux*v) =(Pu)*v =u (Pv) 
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if u,v € S(R"). This also generalizes; if u € S(R”),v € L?(R”), the first 
identity continues to hold; as we will see in the next section, so does the second 
identity, once we are able to interpret what it means. 

We mention the following simple application of (3.35), to a short calculation 
of (3.32). With g given by (3.31), we have g = gi * gi, where 


11 
(3.39) gi(€) = 1 foré € |-5: 5; 0 otherwise. 
2°2 
Thus 
ie . sin da 
(3.40) ‘i= On” / en HE ge = (Qn) 1/2 BBE 
-1/2 ae 


and then (3.32) follows immediately from (3.35). 


Exercises 


1. Show that F : L1(IR") + Co(IR”), where Co(R”) denotes the space of functions v, 
continuous on R”, such that v(€) — 0 as |€| — oo. 
(Hint: Use the denseness of S(R”) in L'(R").) 
This result is the Riemann—Lebesgue lemma for the Fourier transform. 

2. Show that the Fourier transforms (3.1) and (3.22) coincide on L1(R”) M L?(R”). 

3. For f € L’(R), set Sef (x) = (20)~/? C. f(é)e'** dé. Show that 


Sef(e) = Dr flo) = f ” Dale —u) f(y) dy, 


where 
R F 
= ; Rx 
Dr(z) = (20) | ei de =. 
ate) = ny? fe ag = 
Compare Exercise 5 of §1. 
4. Show that f € L?(R) > Srf > f in L?-normas R — oo. 
5. Show that there exist f € L'(R) such that Spf ¢ L'(R) for any R € (0,00). 
(Hint: Note that De ¢ L*(R).) 
6. For f € L'(R), set 


Caf (a) = (2m)? f : (1-2) fee as. 


Show that Cr f(x) = Er * f(x), where 


R : . ip 2: 
Ep(e) =n)" f : (1- 3) cit qe = = [==] . 


Note that Ep € L'(IR). Show that, for 1 < p < 0, 


feEL(R) Crf — fin L’-norm, as R > ov. 
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We say the Fourier transform of f is Cesaro-summable if Crf — f as R — oo. 


In Exercises 7-13, suppose f € S(IR) has the following properties: f > 0, 
[20 f(e) da = 1, and [°° xf (x) dx = 0. Set F(E) = (Qm)1/? f (€). The point of 
the exercises is to obtain a version of the central limit theorem. A much stronger result, 
making use of the Fourier transform on tempered distributions, is given in Appendix 
B at the end of this chapter. 

Show that F'(0) = 1, F’(0) = 0, F’”’(0) = —2a < 0. Also,€ 40 > |F(§)| <1. 


8. Set F,(€) = F(E/V/n)”. Relate (27)~!/? F,, (x) to the convolution of n copies of f. 
9. Show that there exist A > 0 and G € C°°((—A, A]) such that f(€) = e*® G(E) for 


|€| < A, and G(0) = 1, G’(0) = G’’(0) = 0. Hence 


Fx(€) =e" G(n-V/6)", for || < AV. 


10. Show that |G(E//n)” — 1] < Cn~° if |é| < n@/2-)/3_ for n large. 
Fix a € (0, $), and set y = (1/2 — a)/3 € (0, §). 

11. Show that, for |€| > 7, |F(E/V/n)| <1- dan —O-2), for n large, so |Fn(&)| < 
enon /4 = dn. Deduce that 

|Fa(€)| dé< C5k-Y/"\/n 0, asn > oo. 
|gl2n7 

12. From Exercises 9-11, deduce that F, — e~*° in L'(R) asn > o0. 

13. Deduce now that (27)~/?F, > (4ma)~1/2e-#"/40 in both Co(R) and L'(R), as 
n — oo. Relate this to the central limit theorem of probability theory. Weaken the 
hypotheses on f as much as you can. - 

(Hint: In passing from the Co-result to the L*-result, positivity of F;, will be useful.) 

14. With p-(a) = (4me)~1/2e-2°/4¢, as in (3.13) for n = 1, show that, for any u(x), 
continuous and compactly supported on R, p- * u — u uniformly as e — 0. Show 
that for each ¢ > 0, p- * u(x) is the restriction to R of an entire holomorphic function 
of x € C. 

15. Using Exercise 14, prove the Weierstrass approximation theorem: 

Any f € C({a, 6]) is a uniform limit of polynomials. 
(Hint: Extend f to u as above, approximate u by p- * u, and expand this in a power 
series.) 

16. Suppose f € S(IR”) is supported in Bk = {x € R” : |x| < R}. Show that f(€) is 
holomorphic in € € C” and satisfies 

(3.41) lf(E+ in) SCE e*", En eR”. 

17. Conversely, suppose g(€) = f(€) € S(R”) has a holomorphic extension to C” satis- 


fying (3.41). Show that f is supported in |x| < R. 
(Hint: With w = x/|x|, r > 0, write 


3.42) flo) = (any? ff FE +irw)e™E"* a, 
R” 
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with 
[F(E + tras) eb -7| < On (ey -N em PID, 
and let r — +00.) 
This is a basic case of the Paley—Wiener theorem. . 
18. Given f € L*(R”), show that f is supported in Br if and only if f(€) is holomorphic 
in C”, satisfying 
fE+iml sce, Ener” 


Reconsider this problem after reading 84. 
19. Show that 


(Hint: See (A.13)-(A.15).) 


4. Distributions and tempered distributions 


L. Schwartz’s theory of distributions has proved to be not only a wonderful tool in 
partial differential equations, but also a device that lends clarity to many aspects of 
Fourier analysis. We sketch the basic concepts of distribution theory here, making 
use of such basic concepts as Fréchet spaces and weak topologies, which are 
treated in Appendix A, Functional Analysis. 

We begin with the concept of a tempered distribution. This is a continuous 
linear functional 


(4.1) w:S&(R") > C, 


where S(R”) is the Schwartz space defined in §3. The space S(IR”) has a topol- 
ogy, determined by the seminorms 


(4.2) peu) = >> sup |x*D%u(a)|. 
laj+|al<e 7c” 


The distance function 
co 


= _p_ Pk(u—v) 
(4.3) d(u,v) = d 2 ee TEST 


makes S(IR") a complete metric space; with such a topology it is a Fréchet space. 
For a linear map w as in (4.1) to be continuous, it is necessary and sufficient that, 
for some k, C, 


(4.4) |w(u)| < C pp(u), forall u € S(R”). 
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The action of w is often written as follows: 
(4.5) w(u) = (u,w). 
The set of all continuous linear functionals on S(R”) is denoted 
(4.6) S’(R”) 
and is called the space of tempered distributions. 

The space S’(IR”) has a topology, called the weak* topology, or sometimes 
simply the weak topology, in terms of which a directed family w., converges to w 
weakly in S’(R”) if and only if, for each u € S(R”), (u, wy) > (u, w). One can 
also consider the strong topology on S’(R”), the topology of uniform convergence 
on bounded subsets of S(R”), but we will not consider this explicitly. For more on 
the topology of S and S’, see [H,Sch, Yo]. We now consider examples of tempered 


distributions. 
There is a natural injection 


(4.7) L?(R”) 3 S’(R"), 


for any p € [1, oo], given by 


(4.8) (u, f) = fuorte) dz, uw€S(R"), f € L(R”). 


Similarly any finite measure on R” gives an element of S’(R”). The basic exam- 
ple is the Dirac “delta function” 6, defined by 


(4.9) (u, 6) = u(0). 

Also, each differential operator D; = —i0/0x; acts on S’(R”), by the definition 
(4.10) (u, Djw) = —(Dju,w), wES, wes’. 

Iterating, we see that each D® = DY! .-- D&” acts on S’: 

(4.11) D® : S'(R") — S'(R"), 

and we have 

(4.12) (u, Dw) = (—1)!*!(D%u, w) 

foru € S,w € S’. Similarly, 


(u, fw) = (fu, w) 
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defines fw for w € S’, provided that f and each of its derivatives is polynomially 
bounded. 
To illustrate, consider on R the Heaviside function 


H(x)=1 if «>0, 


4.13 
ae 0 if «<0. 


Then H € L™(R) C S’(R), and the definition (4.10) gives 


(4.14) a 
dx 


as a consequence of the fundamental theorem of calculus. The derivative of 6 is 
characterized by 


(4.15) (u, 6’) = —u’(0) 


in this case. 

The Fourier transform F : S(R”) — S(R”), studied in §3, extends to S’ in the 
following fashion. First, by (3.5) and the estimate ||Ful|p~ < (27)~"/?|lullz:, 
we have 

Pr(Fu) < CePran4i(u), 
so F is a continuous linear map on S(IR”). We can hence define the extension to 
S'(R") by 


(4.16) (u, Fw) = (Fu, w); 


we can also set 


(4.17) (u, F*w) = (F*u,w) 
to get 
(4.18) F, Ft: S!(R") —> §'(R"). 


The maps (4.18) are continuous when S’(R”) is given the weak* topology, as 
follows easily from the definitions. 
The Fourier inversion formula of Proposition 3.1 yields: 
Proposition 4.1. We have 
(4.19) F*F =FF* =I on S'(R"). 
Proof. Using (4.16) and (4.17), ifue S, we S’, 


(u, F*Fw) = (Feu, Fw) = (FF*u,w) = (u,w), 
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and a similar analysis works for FF*w. 


As an example of a Fourier transform of a tempered distribution, the definition 
gives directly 


(4.20) F5 = (Qn)-"/?; 


the Fourier transform of the delta function is a constant function. One has the 
same result for ¥*6. By the Fourier inversion formula, 


(4.21) Fl = (2n)"/76. 
Next, let us consider on the line R, for any ¢ > 0, 


H(z) =e ** forx > 0, 


(4.22) 
0 forx < 0. 


We have, by elementary calculation, 


(4.23) HAO = On) gee de = (Qn) (eb ig), 
0 


for each € > 0. Now it is clear that 
(4.24) H, > H ase \,0, in S’(R), 
in the weak* topology. It follows that 


(4.25) A(€) = (20)-¥? lim (e+i€)! in S’(R). 


In particular, the limit on the right exists in S’(IR). Changing the sign of x in 
(4.22)-(4.24) and noting that H(—x) = 1 — H(x), we also have 


(4.26) (Qn)'/25 — H = (2n)-1/? lim (c —i€)~1 in S’(R). 
Let us set 

moar -1_ 4}: te 1 
(4.27) (€+10)* = _ (€+%e)~. 


Then (4.25) and (4.26) give 
(4.28) H = -i(2n)~/?(¢ — 10)7! 


and 
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(4.29) (€ + 10)~? — (€ — i0)~1 = —2776. 


The last identity is often called the Plemelj jump relation. Also, subtracting (4.25) 
from (4.26) gives 


(4.30) (E+ i0)-1 + (€ — 10)~* = (2m) /?i senile), 
where 
= lifx#>0, 
(4.31) en ites 
-lifx<0. 
It is also an easy exercise to show that 
1 
(4.32) (€+40)-' + (€-10)"' =2 PV (=z). 
where the “principal value” distribution 
1 / 
(4.33) pv(=) € S’'(R) 
is defined by 
1 
(1, PV (+)) = lim / a) dz 
x h\O x 
R\(—A,h) 
(4.34) ia E v) “| de 
ANOS, we x 
af tg, 
(0) x 
Note that if we replace the left side of (4.29) by 
1 1 21 2; 1 1 
— — tu . 
Etie E-te £2 4 €? ée (€/e)? +1’ 


the conclusion (4.29) is a special case of the following obvious result. 
Proposition 4.2. /f f € L'(R"), f f(x)dx = Co, then, as € — 0, 
(4.35) e "f(e—1x) 4 Cod in S'(R"). 


That 6 is the limit of a sequence of elements of S(IR”) is a special case of the fact 
that S(IR”) is dense in S’(R”), which will be established shortly. 
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Given w € S’(R”), if Q C R” is open, one says w vanishes on 2 provided 
(u, w) = 0 for all u € Coe (Q). By a partition of unity argument, it follows that if 
w vanishes on (,, then it vanishes on their union; w is said to be supported on a 
closed set K C R” if w vanishes on R” \ K. The smallest closed set K on which 
w is supported exists; it is denoted supp w. Note that if w vanishes on all of R”, 
then w = 0, since C§°(IR”) is dense in S(R”). (See the first part of the proof of 
Proposition 4.4 below.) If w € S’(IR”) is supported on a compact set K Cc R”, 
we say w has compact support. The space of compactly supported distributions 
on R” is denoted by 


(4.36) é'(R”). 


If w € S’(R”) is supported on a compact set K C R”, then w can be extended to 
a continuous linear functional 


(4.37) w:C°(R”) + C 
by setting 
(4.38) (u,w) = (xu,w), ue C™(R”), 


for any x € C§°(R”) such that y = 1 on a neighborhood of K. The space 
C@(R”) is also a Fréchet space, with topology defined by the seminorms 


(4.39) Pra(u) = sup S> |D%u(x)|. 
IZISR alk 


To say a linear map (4.37) is continuous is to say there exist R, k, and C’ such that 
(4.40) |w(u)| << C pre(u), for allu € C°(R”). 


Such a linear functional restricts to S(R") C C(R"), so it defines an element 
of S’(R”), and from (4.40) it follows that such an element must be supported in 
the compact set {x € R” : || < R}. Thus the space (4.36) is precisely the dual 
space of C™(R”). 

Fourier transforms of compactly supported distributions have some special 
properties. 


Proposition 4.3. [fw € €/(R”), then w € C®(R") and, with e¢(x) = eS, 
(4.41) (E) = (2n)-"/7 (eg, w), 


for all € € R”. Furthermore, w extends to an entire holomorphic function of 
€ec". 


Proof. For any u € S(R”), (u,w) = (a, w). Now we can write 
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(4.42) ae) = (2m)? f(g) (x) a, 


the integral converging in the Fréchet space topology of C™(IR”), and the conti- 
nuity of w acting on C(I”) implies 


(a,w) = (2n)-”/? / ulé)(e¢,w) a, 


which gives (4.41). The right side of (4.41) is clearly holomorphic in € € C”. 
We next obtain the promised denseness of S(R”) in S’(R”). 
Proposition 4.4. C§°(R”) is dense in S'(R"), with its weak* topology. 


Proof. Pick » € C§°(R”), y(0) = 1. It is easy to see that, given w € S’(R”), 
yw is well defined, by (u, pw) = (yu, w). Also, if p;(z) = y(x/7), then for 
u€S(R"), 


(4.43) pju—u inS(R"), 


which gives sequential denseness of Cp°(IR") in S(IR”). Hence yyw > w 
in S’(R”) as 7 > oo. Since F and F* are continuous on S’(R”), we have 
F*(y;w) > was j — oo. Now for each j, wjr = pr(F*yj;w) > F*(y;W) as 
k — oo. But by Proposition 4.3, F*(;w) is smooth, so w;, € Co°(R”), and the 
result follows. 


One useful result is the following classification of distributions supported at a 
single point. 


Proposition 4.5. If w € S’(R") is supported by {0}, then there exist k and com- 
plex numbers Gq such that 


(4.44) w= So a,D%, 


lal<k 


Proof. We can suppose w satisfies the estimate (4.4). Thus w extends to Bx, the 
closure of S(IR") in the space of C*-functions on R” for which the norm p, is 
finite. By hypothesis, w annihilates the linear space Ep of elements of C§°(R”) 
vanishing on a neighborhood of 0; thus w annihilates the closure of Ep in B;; call 
this closure €;,. It is not hard to prove that 


See Exercise 7 below for some hints. Now, for general u € By, write 
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(4.46) u(x) = x S- a + u(x), 


|a|<k 


where y € C§°(R”), x(x) = 1 for |z| < 1, and u? © E,. Applying w to both 
sides, we have an expression of the form (4.44), with ag = (—1)!°! (ya, w) /al. 


As an application of Proposition 4.5, we establish the following result, which 
is an extension of the classical Liouville theorem for harmonic functions. 


Proposition 4.6. Suppose u € S'(R") satisfies 

(4.47) Au =0 inR”. 

Then wu is a polynomial in (x1,...,%n). 

Proof. As in §3, the identity 

(4.48) Au = f € S’(R”) 

is equivalent to 

(4.49) —|€/?a = f in S’(R”). 

In particular, (4.47) for u € S’(R”) implies 

(4.50) |€/?4 = 0 in S’(R”). 

This of course implies 

(4.51) supp t C {0}. 

By Proposition 4.5, % must have the form (4.44), and the result follows. 
It is clear that any nonconstant polynomial blows up, so we have: 


Corollary 4.7. [fu is harmonic on R” and bounded, then wu is constant. 


This is the classical version of the Liouville theorem. We remind the reader 
of one of its uses. If p(z) is a polynomial on C, and if it has no zeros, then 
q(z) = 1/p(z) is holomorphic (hence harmonic) on all of C; clearly, |q(z)| > 0 
as |z| > oo if deg p > 1. Corollary 4.7 yields the obvious contradiction that q(z) 
would have to be constant. This proves the fundamental theorem of algebra: Any 
nonconstant polynomial p(z) must have a complex root. 

See §3 of Chapter 5 for stronger Liouville theorems. 

Moving on, let us consider the function 
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(4.52) Ww ( Zz) = 


This is holomorphic on C \ {0}. It is integrable near 0 and bounded outside a 
neighborhood of 0, and hence it defines an element of S’(R?). (0/0Z)WV € S’(R?) 
is supported at {0}. In fact, we claim: 


Proposition 4.8. We have 


(4.53) 2 a) 


zz 


Proof. Let u € C§°(R*). We have 


(oaez)=— ff ee 
t 


ee O(z71u) 
== lim, ff SS ae dy, 


R2\B. 


(4.54) 


where B. = {(x,y) € R* : a? + y? < ©?}. By Green’s formula, in the form 
(2.37), the right side is equal to 


1 
(4.55) 2. | ay. 
2% z 
OB: 


which is clearly equal in the limit ¢ —> 0 to 7u(0). This proves the proposition. 


We say (1z)~! is a fundamental solution of (0/02). We will say more about 
the use of fundamental solutions later. Let us look at the task of producing a 
fundamental solution for the Laplace operator A on R”. In view of the rotational 
invariance of A, we are led to look for a function of r = ||, for x 4 0. The form 


0? n-10 1 
+ + aAs; 
‘ ann 


(4.56) A= 55 7 


for A in polar coordinates, where Ag is the Laplace operator on the unit sphere 
S”-1) shows that, for n > 3, 


(4.57) el a 


is harmonic for x # 0. As it is locally integrable near 0, it defines an element of 
S’(IR”). We have the following result. 


Proposition 4.9. [fn > 3, 
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(4.58) A(|z|?-") = Crd on R”, 
with C,, = —(n — 2)-Area(S"~*). Also, 
(4.59) A (log |x|) = C25 on R?, 


with Cg = 27. 


Proof. This will use Green’s formula, in the form 


Ou Ov 
(4.60) [(du-w-u- do) ae f [ose ue] ds. 
Q 0Q 
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Let u € C§°(R”) be arbitrary. Let v = |z|?-”, and let OQ = 0, = R” \ B., where 


Bz = {x € R”: |x| < ec}. We have 


(Au, lee") = tim, f Aw: ae dx 
Ee 
Qe 
= lim | [Au- |x)?" — u- Ala|?-”] dx 
e—0 


Qe 


(4.61) 


since A|a|?~” = 0 for x 4 0. Applying (4.60), we have this equal to 


(4.62) —lim ie" aie neu dS. 


Since the area of OB; is e”~-Area S”—1, this limit is seen to be 
(4.63) —(n — 2)u(0)- Area S"~", 


which proves (4.58). The proof of (4.59) is similar. 


Calculations yielding expressions for the area of S”~! will be given in the 


appendix to this chapter. 
Note that the equation 


(4.64) A®=6 onR", 
with ® € S’(R”), is equivalent to 
(4.65) —|€|?@ = (2)-"/?, 


Ifn > 3, |€|~? € Li, (IR) and one solution to (4.65) is 


loc 
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(4.66) &(€) = —(2n) "||? E S'(R"), 


in such a case. We can relate this directly to (4.58) as follows. The orthogonal 
group O(n) acts on S(R”), by 


(4.67) m(g)u(a) =u(g-'a), 2 ER", g € O(n), 

and this extends to an action on S’(R”), via (u,(g)v) = (t(g~*)u,v). This 
action commutes with F. Thus the Fourier transform of an element like |2|?~” 
which is invariant under the O(7)-action will also be O(m)-invariant. There is also 
the dilation action on S(R”), 

(4.68) D(s)u(z) = u(sz), 5 >0,2ER", 

which extends to S’(R”), via (u, D(t)v) = t~"(D(1/t)u, v). We have 

(4.69) D(s)F = s-"FD(s“'). 

The element ||?~” € S’(IR”) is homogeneous of degree 2 — n, that is, 

(4.70) D(s)(|a/?") = 8°" |2)?™, 

so F(\a|?—”) will be homogeneous of degree —2. This establishes that ®(2) = 
C,,|x|?—” satisfies (4.66), up to a constant factor. Note that 6 € S’(IR”) is homo- 
geneous of degree —n. Since the Laplace operator A decreases the order of 
homogeneity of a distribution by two units, these considerations of orthogonal 
invariance and homogeneity directly suggest a constant times |x|?~” as a suitable 


candidate for a fundamental solution for the Laplace operator on R”. 
We mention some extensions of the convolution 


(4.71) ux v(x) = one —y) dy, 

which, by Proposition 3.3, gives a bilinear map 

(4.72) S(R”) x S(R”) > S(R”). 

Note that if u,v, w € S(R”), 

(4.73) (ux*v,w) = (u,v* «w), 

where v* (x) = v(—a), so the convolution extends in a straightforward way to 


(4.74) S(R”) x S’(R”) 3 S'(R”), 
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with S(R”) x €’(R”) > S(R”), and hence 


(4.75) €'(R”) x S’(R") > S'(R"). 
In either case, the identity 
(4.76) Fux v) = (2m)"/?a(€)6(€) 


continues to hold. For more on this, see Exercises 11-13 below. If P is any 
constant-coefficient differential operator, then 


(4.77) P(u«v) =(Pu)*v=ux Pv 

in cases (4.74) and (4.75). For example, if ® € S’(IR”) and 
(4.78) Pb=5 

(we say ® is a fundamental solution of P), then a solution to 
(4.79) Pu=f, 

for any given f € €’(R”), is provided by 

(4.80) u= fr, 


An object more general than a tempered distribution is a distribution. In gen- 
eral, a distribution on R” is a continuous linear map 


(4.81) w:Co(R”) — C. 


Here, continuity can be characterized as follows. For each yp € C§°(R”), the iden- 
tity (u, pw) = (yu, w) makes yw a linear functional on C™(R”). We require that 
each such linear functional be continuous, in the sense specified in (4.40). For 
further discussion, including a direct discussion of the natural “inductive limit” 
topology on Cf°(R”), see [RS] and [Sch]. The space of all distributions on R” is 
denoted by 


(4.82) D’(R”). 


More generally, if 1/7 is any smooth, paracompact manifold, the space of con- 
tinuous linear functionals on C'°(M) is denoted by €’(/) and the space of 
continuous linear functionals on C§°(M) is denoted by D’(M). Of course, if 
M is compact, €'(M) = D'(M). 

The case M = T” is of interest, with respect to Fourier series. Given w € 
D'(T”), we can define 
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(4.83) Fu(k) = W(k) = (20) (ex, w), 
where 
(4.84) ex (0) =e **? € OM (T"). 


Since w must satisfy an estimate of the form 
(4.85) |(u, w)| < Cllullcecrn), 


it is clear that 


(4.86) F:D(T") — s'(Z”), 
where 
(4.87) s'(Z”) = {a: Z” > C: |a(k)| < Clk) for some C, N} 


consists of polynomially bounded functions on Z". Note that s’(Z”) is the dual 
space to s(Z"”), defined in §1, and the map F in (4.86) is the adjoint of the map 
F* : s(Z”) —> C%(T"”) given by (1.11) and (1.12). Here we use the Hermitian 
inner product (wu, w) = (u,W) = (U,w). The map F : C°(T") > s(Z”) given 
by (1.1) and(1.6) also has an adjoint 


(4.88) F*: s'(Z") 3 DT"), 
extending the map (1.11)—(1.12), which, we recall, is 


(4.89) (F*a)(0) = > a(kje**. 


keZn 
The Fourier inversion formulas 
(4.90) F*F =IonC™(T"), FF* =Ions(Z"), 


extend by duality (or by continuity, and denseness of C°(T”) in D’(T”) and of 
s(Z”) in s'(Z”)) to 


(4.91) F*F =IonD'(T"), FF* =Ions'(Z"); 


consequently, the map (4.86) is an isomorphism. 


Exercises 


1. Define Myu by (v, Myu) =(fv,u), for ve S(R"), we S’(R"). Meu is 
also denoted by fu. Show that My : S’(IR") > S’(R”) continuously, provided 
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f € C™(R") and each derivative is polynomially bounded, that is, |D° f(x)| < 
Coa) NOY), 


2. Show that the identity DEF IG) = (-1)!"|F(D%x* f)(€) from §3 continues to 
hold for f € S’(R”). 

3. Calculate F of x* and of D6. 

4. Verify the identity (4.32) involving PV (1/€). 

5. Give a proof of Proposition 4.4, that S(IR”) is dense in S’(IR”), using convolutions in 
place of the Fourier transform. 

6. Show the denseness of S(R") in S’(R") follows from the Hahn—Banach theorem. See 
Appendix A for a discussion of the Hahn—Banach theorem. On the other hand, sharpen 
the proof of Proposition 4.4, to obtain sequential denseness. 

7. Prove the identity (4.45), used in the proof of Proposition 4.5. 

(Hint: Fix n € C'°°(R”) so that n(a) = 0 for |x| < 1/2, 1 for |a| > 1. Show that 
n(Ra)u — wu in By, as R — oo, for any u € Ex. This, plus a couple of further 
approximations, yields (4.45). 

8. Let f(x,y) = 1/z € S’(R’). Using (4.53), compute Ff € S'(R*). Using 
Proposition 4.9, compute the Fourier transform of log |x| on R®, and of |x|?~” on 
R”, n > 3. Reconsider this problem after reading §8. 

9. Let u € €’(R"), and suppose (p,u) = 0 for every polynomial p on R”. Show that 
u = 0. (Hint: Show that D°%(0) = 0 for all a; but @ is analytic.) 

Show that this result implies the Weierstrass approximation theorem, discussed in 
Exercise 15 of §3. 
10. Let f € C™(R”) be real-valued, Y = {x : f(x) = O}. Define 6(f(x)) € D’(R”) 
to be 
(4.92) 6(f(x)) = lim 46-(f(2)), 
e\0 
where 6-(t) = 1/e for |t] < ¢/2, 0 otherwise, provided this limit exists, with respect 
to the weak” topology on D’(R”). Show that if Vf 4 0 on , the limit does exist and 
that, for u € Co? (R"), 
(u, 6(F(@)) = f uanlV FW)" a5), 
z 
where dS is the (n — 1)-dimensional measure on 2’. Consider cases where the limit 
in (4.92) exists though V f vanishes on a variety in »’. 
11. Using an argument like that in the proof of Proposition 4.5, show that if u € S’(R”) 


has support in a closed ball B, then, for some C, k, 


fu) <C sup |D*f(ax)]. 


x€B,la\<k 


(Hint: Establish the following analogue of (4.45). If €g is the linear space of elements 
of C§°(R”) vanishing on a neighborhood of B, and €; is the closure of Eg in Bx, 
then 

Ex = {u € By: u=0on B}. 


Then show that if wu : B, — C is continuous and supp u C B, 


(f,u) = (Ep(f),u), 
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where p(f) = f | pandeE : C*(B) — By is an extension operator. For help in 
constructing FE, look ahead to §4 of Chap. 4.) 
12. Ifu € E’(R"), show that & € C*(R") satisfies an estimate 


(4.93) la(g)| < CE)", EER", 


for some m € R. More generally, show that if a distribution wu has support in Br = 
{x € R”: |x| < R}, then 


(4.94) |a(E + in)| < CE + iny™ 7, En eR”. 


(Hint. Use (4.39)-(4.41). For (4.94), use the result of Exercise 11.) 

13. Given the formula (4.76) for F(u * v) when u € S(R”), v € S’(R”), show that if 
u € S(R") and v € €’(R"), then F(u * v) € S(R"), hence u x v € S(R"), as 
asserted above (4.75). (Hint: Use (4.93).) 

14. Show that the convolution product extends to 


é/(R") x D’'(R") — D’(R"),  €'(R”) x €'(R”) — E’(R”). 


15. Given u € €'(R”), show that there exist k € Z*, f € L?(R”) such that u = 
Aly. 
(Hint: Obtain (€)~?**™ € L?(R”).) 
Show that there exist compactly supported fa € L? IR") such that u = Dlal<k D" fa- 
16. Assume that u € S’(IR”) and & is holomorphic in C” and satisfies (4.94). Show that 
u is supported in the ball Br. This is the distributional version of the Paley—Wiener 


theorem. 
(Hint: Pick y € C5°(R”), supported in Bi, fp dx = 1, let y.(z) = e "y(a/e), 
and consider ue = ye * u € S(IR”). Apply Exercises 17 and 18 of §3.) 


5. The classical evolution equations 
In this section we analyze solutions to the classical heat equation on Rt x R”, to 


the Laplace equation on ae and to the wave equation on R x R”. We begin 
with the heat equation for u = u(t, x), 


(5.1) —— —Au=0, 


where A is the Laplace operator on R”, 


O7u O7u 


2 eee 
(5.2) uaa tot aa 


We pose an initial condition 


(5.3) u(0, x) = f(x). 
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We suppose that f € S’(IR”), and we look for a solution u € C® (R’, S'(R")), 
via Fourier analysis. Taking the Fourier transform of u with respect to x, we obtain 
the ODE with parameters 


Ou 


_— _jej2e 
(5.4) ae |€|"a(t, ), 
with initial condition 

(5.5) a(0,€) = fF. 


The unique solution to (5.4)-(5.5) is 


(5.6) a(t,€) =e“ fe). 
Set 
(5.7) G(t, £) = e #8? 


Note that, for each t > 0, G(t,-) € S(R”). By (4.76), we have 
(5.8) u(t, x) = (2m)~"/?G(t, -) « f(z). 


The computation of the Fourier transform of such a Gaussian function was made 
in §3. From (3.13), we deduce that 


(5.9) u(t, xz) = p(t,-) * f(x), 
where 
(5.10) p(t, x) = (4nt)~"/2e HP? /4t 


for t > 0. The function p(t, 2) is called the fundamental solution to the heat 
equation. It satisfies 


(0/dt — A)p =0, fort > 0, 


(5.11) : _ + Of (7QN 
_ p(t,z) = d(x) in S’(R”). 


We record what Fourier analysis has yielded for the heat equation. 


Proposition 5.1. The heat equation (5.1)-(5.3), with f € S'(IR”), has a unique 
solution u € C™(R’, S’(R")), given by (5.9)-(5.10). The solution is C°° on 
(0,00) x R". If f € S(R"), then u € C~(R*,S(R")). 
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Note carefully that uniqueness of the solution is asserted only within the class 
ce (R’, S’(R”)); this entails bounds on the solution considered, near infinity. 
If one removes such growth restrictions, uniqueness fails. There exist nontrivial 
solutions to 


) 
(5.12) (4 — a) v=0, fort >0, v(0,2) =0, 
outlined in Exercise 2 at the end of this section. For fixed t > 0, such solutions 
blow up too fast to belong to S’(R”). 

In view of the boundedness and continuity properties of (5.7), we have the 
following: 


Corollary 5.2. Suppose f € L?(R"). Then the solution u to (5.1)-(5.3) of Propo- 
sition 5.1 belongs to CR’, L?(R")). 


We cannot say that u € C™(R* , L(R")), or even that Ou/Ot belongs to 


C(R", L2(R")), in such a case, without further restrictions on f. The appro- 
priate behavior of 0/u/0t? in such a case is best described in terms of Sobolev 
spaces, which will be discussed in Chap. 4. 

Next we look at the following boundary problem for functions harmonic in an 
upper half space: 


fag 

(5.13) => +A)u(y,x) =0, fory >0, 2 € R”, 
Oy? 

(5.14) u(0,x) = f(x). 

Here, A is given by (5.2). In view of such simple examples as 


(5.15) u(y, £) = y, 


which satisfy (5.13) and (5.14) with f = 0, we will need to make appropriate 
restrictions on u in order to obtain uniqueness. As before, we suppose that f € 


S’(IR”) and look for u € C® (R, S'(R")). Fourier transforming with respect to 
x gives the second-order ODE, with parameters, 


d2 
(5.17) a(0,é) = f(©. 
The general solution to (5.16)—(5.17), for any fixed € + 0, is 


(5.18) ti(y, €) = co(€)e¥!§! + ey (E)e¥"*!, 
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with co(€) + c1(€) = f(€). Let us restrict attention to f such that f(€) is contin- 
uous, and look for a(y, €) continuous in (y, €); as (5.15) illustrates, things can be 
more complicated if ti(y,-) is a singular distribution near € = 0. In view of the 
blow-up of e¥!§! as y 7 00, it is natural to require that co(€) = 0, so 


(5.19) fi(y, €) = eV F(E). 
In partial analogy with Proposition 5.1, we have obtained the following result. 


Proposition 5.3. Let f € S'(R”), and suppose f is continuous. Then there 
is a unique solution u(y,x) of (5.13) and (5.14), belonging to the space 
Ce (R*,S’(R")) and satisfying the condition that &(y,€) is continuous on 


R* x R", and furthermore that u(y,-) is a bounded function of y taking values 
in S'(R”). It is given by (5.19). 


Note that f (€) is continuous provided f is a finite measure. It is also continuous 
if f € E’(R"). 

We want to find the “fundamental solution” P(y, x) for (5.13)-(5.14), corre- 
sponding to f = 6. In other words, 


(5.20) P(y,€) = (20)7"/2e- Hel, 


This computation is elementary in the case n = 1. We have 


P(y, x) = (2n)~* a e ¥lél ties d€ 


—cCo 


= (2n)71 [ —yEt+ivg g +f eds 
(5.21) eae iy. ce] & g 


(2m)~* [(y — ix)? + (y + te) ~*] 
ly 
T yz +a? 


l| 


For n > 1, a direct calculation of such a Fourier transform is not so elementary. 
One way to perform the computation is to use the following subordination iden- 


tity: 
(5.22) eA = sin | ev’ /4t o-t4’ 4-3/2 ge A> 0, y>0. 
T 0) 


We will give a proof of (5.22) shortly. First we will show how it leads to a com- 
putation of the Fourier transform of (5.20). We let A = |€|, and we use our prior 


. . 2 
computation of the Fourier transform of e~!§!. Thus, for any n > 1, 
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Ply) = (2m) f ewwlel¥™€ ag 


= ent, en /4t / eth ee tag 4-3/2 dt, 
a 0 


and substituting in the calculation (5.7)—(5.10), we have 


(5.23) 


P(y, 2) = (4m)7 YD /2y - e-¥'/4t e— lal” /4tg—(m+3)/2 gy 
(5.24) 0 


_ y 
"GP PO 


where the last integral is evaluated using the substitution s = 1/t. The constant 
Cn is given by 


(5.25) by = OED (2) 


where ['(z) = i e~*s*—1ds is Euler’s gamma function, which is discussed in 
the appendix to this chapter. Note that c; = 1/7, so the calculation (5.24) agrees 
with (5.21), in the case n = 1. 

The observation that the calculations (5.21) and (5.24) coincide for n = 1 can 
be used to provide a simple proof of the subordination identity (5.22). Indeed, 
with |é| substituted for A in (5.22), we know that the operation of Fourier multi- 
plication by the left side coincides with the operation of Fourier multiplication by 
the right side, so the two functions of |€| (€ € R) must coincide. 

There are other proofs of (5.22). In Appendix A we note the equivalence of 
such an identity and a classical identity involving Euler’s gamma function. While 
the proof of (5.22) given above is complete, it leaves one with an unsatisfied feel- 
ing, since the right side of the formula (5.22) seems to have been pulled out of a 
hat. We want to introduce a setting where such a formula arises naturally, a set- 
ting involving the use of operator notations, as follows. For a decent function f 
defined on [0, 00), define f(,/—A) on S(R”), or on L?(R”), or even on S’(R"), 
when it makes sense, by 


(5.26) (f(V—A)u) ~ (8) = Flea). 

Thus, the content of (5.10) is 

(5.27) et. 5(x) = (Ant) 77/2914 

for t > 0. The formula (5.24) is a formula for e~¥Y~46(x), x € R”, and the 


formula (5.21) is the special case of this for n = 1. 
We will approach the subordination identity via the formula 
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Co 
(5.28) GO =A) =) e OP —At ge, 
0 


for the resolvent (\? — A)~! of the Laplace operator A. This identity follows via 
the Fourier transform, as in (5.26), from 


(5.29) (A? + |€P?)7 = | e OP HEE ae, 
0 


In order to derive the subordination identity, we will apply both sides of (5.28) 
to 6; we will do this in the special case of A = d?/dx? acting on S’(R'). For the 
special case € € R!, we have the Fourier integral formula 


(5.30) - (A? + €7)-1e#6 dé = cae (A > 0), 


—Co 


a fact that can be established either by residue calculus or by applying the Fourier 
inversion formula to the computation (5.21). Thus applying both sides of (5.28) 
to 5 € S'(R) and using e!45 = (4nt)~1/2e-**/4* in this case, we have 


1 


(5.31) Note Alel = = ent /4te—N7t4-1/2 dt. 
0 


Tv 


Making the change of variables y = |x|, A = A, and taking the y-derivative of 
the resulting identity give (5.22). Also note that taking the A-derivative of (5.22) 
gives (5.31). The identity (5.31) is also called the subordination identity. One can 
see that it arises very naturally in this context, from (5.28). We will return to the 
calculation of (A? — A)~16(x) in case n > 1, later in this section. 

We next consider the wave equation on R x R”: 


O7u 
(5.32) ap 7 Au = 0, 
with initial data 


As before, we suppose that f and g belong to S’(IR”) and look for a solution wu in 
C™(R, S’(R”)). Taking the Fourier transform with respect to x again yields an 
ODE with parameters: 


Cu oy. 


(5.35) (0,€) = f(6), t(0,€) = 9(€). 
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The general solution to (5.34) (for € 4 0) is of the form 


a(t, €) = co(€) sin t|g] + c1(€) cos t]€1, 


which it is convenient to write as 


A(t, €) = co(E)|E|~* sin t]€| + c1(€) cos tlE], 


since the right side is well defined for any co, c, € S’(R”). The initial conditions 
(5.35) imply c,; = f, co = Gg, so the solution to (5.34)-(5.35) is 


(5.36) a(t, €) = g(€)|€\* sin t|€| + f(€) cos tl€|. 


This is clearly the unique solution in C°(R,S(R”)), if f,g € S(R”). The 
uniqueness in C™(R, S’(R”)) for general f, g € S’(R”) will be proved shortly. 

If f =0, g = 64, the solution given by (5.36) is called the fundamental solu- 
tion, or the “Riemann function.” Of course, it is actually a distribution. It is 
characterized by 


(5.37) R(t, 6) = Qa)" €| "sin t€|. 

We want a direct formula for R(t, x). We will be able to deduce this formula from 
the formula (5.24) for the Fourier transform of e~¥!§!, via analytic continuation. 
To bring in the factor |€|~1, integrate (5.24) with respect to y. Thus, if 


(5.38) Fly,é) = (Qr)—"/7]E[ “te ¥El, 


which belongs to S’(IR”) for each y > 0 if n > 2, we have 


(5.39) F(y,z) = (y? + \x|2)-@-)/2 
with 
1 n—-1 
A Pe ey 
(5 0) Cr | 57 5 


This has been verified for real y > 0. But (5.38) is holomorphic in y, with values 
in S’(R”), for all y such that Re y > 0. Also, it is continuous in the right half- 
plane {y € C: Re y > O}. In view of the continuity of the Fourier transform on 
S'(R”), we deduce that if 


(5.41) &(t, £) = (2m)~"/2|E|1e##lEl 


t € R, then 
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5.42 (t,x) = lim ¢, (lal? — (t — ie)2) 9? 
(5.42) (t,2) = lim ¢, (|a|" — (¢ — ée)’) 
the limit existing in S’(R") for each t € R, since ®(t, -) = limo ®(t — ie, -) in 
S’(R”). Consequently, for the Riemann function, we have 


A = jj / I 2 t—i 2 ee 
(5.43) R(t, x) lim ch m(|z|? — (t — ie)*) 


Note that if |x| > |¢|, lime\o(|2|? — (¢- ie)2) 9? = (|e? — #2)-@-D/2 
is real, so 


(5.44) R(t,x) =0, for |x| > |t]. 


This is a reflection of the finite propagation speed, which was discussed in 
Chap. 2. Note also that if n is odd, then (n — 1)/2 is an integer, so (5.43) van- 
ishes also for |x| < |¢|. In other words, 


(5.45) n >3o0dd => supp R(t,-) C {a € R”: |a| = |t|}. 


This is the strict Huygens principle. Of course, it does not hold when n is even. 
When n = 2, the computation of the limit in (5.43) is elementary. We have 
R(t, x) = o(t? — |e?) - sgn(t), for |x| < dl, 


(5.46) 
0, for |x| > |tl, 


for n = 2. For n = 3, the Plemelj jump relation (4.29) yields 
(5.47) R(t, 2) = (4nt)~*6(|x| — |¢]). 


The discussion above has to be modified for n = 1, since (5.41) is not locally 
integrable near € = 0 in that case. This simple case (n = 1) was treated in §1 of 
Chap. 2; see (1.24)—(1.28) in that chapter. 

The solution to (5.32)-(5.33) given by (5.36) can be expressed as 


(5.48) u(t.) = Blt, #9 + & Rt) f 


We record our result on solutions to the wave equation. 


Proposition 5.4. Given f,g € S'(R"), there is a unique solution u€ C% 
(IR, S’(R”)) to the initial-value problem (5.32)-(5.33). It is given by (5.48). 


Proof. The only point remaining to be established is the uniqueness. Suppose 
u € C™(R, S’(R”)) solves (5.32)-(5.33) with f = g = 0. Then 
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u(t, ‘) = u(t, *) ep 


solves the same equation, for any y € C§°(R"); » = (ax). We have v € 
C@™(R x R”). Thus the energy estimates of Chap. 2 are applicable to v, and we 
have v = 0 everywhere. Taking a sequence y; € C§°(IR”) approaching 4, we 
have v; —> u; since each v; = 0, it follows that wu = 0, and the proof is complete. 


We note that the argument above yields uniqueness for wu in the class 
C™ (IR, D’(R")). We also remark that any u € D’(R x R”) solving (5.32) 
actually belongs to C°(R, D’(R”)), and that (5.48) gives the unique solution to 
(5.32)-(5.33) for any f,g € D’(R”). Justification of these statements is left as an 
exercise. 

Returning to the operator notation (5.26), we have 


(5.49) R(t, 2) = (—A)~/? sintV—A 6(2) 
and 

O — 
(5.50) 5 Plt.) = cos t/—A 6(2). 


We also denote (5.49) by R(t) and (5.50) by R’(t). 

Having introduced in (5.28) the notion of synthesizing some operators from 
other operators, we want to mention the particular desirability of synthesizing 
functions of the Laplace operator from the fundamental solution of the wave equa- 
tion. If y(s) is an even function, the Fourier inversion formula implies 


(5.51) ” (v-A) = (2m)~1/? i- a(t) costV—A dt. 


Note that 

(5.52) costV—A u = R'(t) «u, 

where R(t) is the Riemann function constructed above. We have the following 
rather general calculation of y(./—A)6, using the formula (5.37) for the Riemann 


function on R”. 


Proposition 5.5. Let y € S(R) be even. Then, on R”, we have 


(5.53) ¢ (V=A) 62) = -soe “ole) 


ifn = 2k +1 is odd, and 
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(5.54) p(v-A yay? fr {[- Foie] ob pg 


ifn = 2k is even. Here, r = |x}. 


Proof. When n = 3, we have 


e(v=8)6=-—— | * gi (t)(amt)-18(r — 2) dt 


== 22) ar), 


giving (5.53) in this case. When n = 2, we have, from (5.46), 


(5.55) 


(5.56) e( =X) 5= = oe @ (t)(#? _ r?)-1/2 dt, 


giving (5.54) in this case, once one checks that c, = (1/2)m~3/?T'(1/2) = 1/2. 
To pass to the general case, we note that if R,,(t,7) denotes the formula (5.43) 
for the Riemann function, in view of the evaluation of c/, in (5.40), we have the 
formal relation 


1 oO 


(5.57) Rr+alt, r) => |- Dare ap 


| rate 


so (5.53) and (5.54) follow by induction. 


It is clear that Proposition 5.5 holds for a more general class of even functions 
y than those in S(R), by simple limiting arguments. For example, the function 
y(s) = (A? + s?)~1, A > 0, giving the resolvent of A, can be treated. We 
leave the formulation of general results on classes of yp which can be treated as an 
exercise. 

In the case of using Proposition 5.5 to treat the resolvent of A, we have the 
following formula. With p(s) = (A? + s”)~1, from (5.30) we have 


1/2 
(5.58) g(t) = (5) Artem Ale 
to plug into (5.53)-(5.54). For example, for n = 3, we have 
(5.59) (0? — A)" 6 = (4n|a|)~te77*! on RE. 


Note that computing (A? — A)~'6 by evaluating the right side of (5.28) gives in 
this case 
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(5.60) (2 - A) 5 = | el" /Ate— Xt (an t)—3/2 at: 
0 


comparing (5.59) and (5.60) again reveals the subordination identity, in the origi- 
nal form (5.22). 

The fact that the answer comes out “in closed form” for n odd is a consequence 
of the strict Huygens principle. For n even, one tends not to get elementary func- 
tions. Note that, for n = 2, the formula (5.54) gives 


(5.61) (2 - A) 6= cf e*8(s? — r2)—1/2 ds on R?; 
|| 
the formula (5.28) gives 
(5.62) (7 -— Ay b= xf e7lel?/4t—t 4-1 gt on R2. 
At Jo 


Both of these integrals can be expressed in terms of the modified Bessel function 
Ko; we say a little more about this in the next section; see (6.46)-(6.54). 

In general, the use of results on the wave equation together with (5.51) provides 
a tool of tremendous power and flexibility in the analysis of numerous functions 
of the Laplace operator. We will see more of this in Chap. 8. 

To end this section, we re-derive formulas for the solution to the wave equa- 
tion (5.32)-(5.33), and then re-derive the formulas (5.53)—(5.54). Recall that the 
solution to the wave equation on R x R” is given by 


sint/—A 
./_A g 


We first derive formulas for these solution operators, in case 


(5.63) u(t, 7) = costV—A f(a) + (x). 
(5.64) n=2k+1, 


by comparing two formulas for e’ f (x). This approach follows material in [PT]. 
The first formula for e’ is 


e!4 f(x) = (dnt)-"/? eluP/4t p(y — y) dy 
(5.65) R" so 
= (Ant)? An f Fa (r)\re te” dr, 
0 


where A,,_, is the area of the unit sphere S”—! in R”, and 
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1 
(5.66) Flt) = 4— 


) dS(w). 


Note that f,,(r) is well defined for all r € R, and f,(—r) = f,,(r). The first 
identity in (5.65) follows, via Fourier analysis, from the evaluation of the Gaussian 
integral 


(5.67) jou dé = (s)"" eo lel?/4e, 


R” 


The second identity in (5.65) follows by switching to spherical polar coordinates, 
y = rw, and using dy = r"~! dr dS(w). 


The second formula for e!4 is 


(5.68) eA f(x )cos sV—A f(x) ds 


as 


with hy(o) = e~'@, hence, by (5.67), hy(s) = (2t)~1/2e~*"/4*. This is a special 
case of (5.51). 

Setting 4t = 1/A and comparing the formulas (5.65) and (5.68), we have (with 
u(s, x) = cos s/—Af (z)) 


a 2 An (n—1)/2 coca 2 
(5.69) | v(s,x)e"** ds = : (*) / f.(r)r” te” dr, 
0 2 0 


Tv 


for all \ > 0. The key to getting a formula for u(s, x) from this is to make the 
factor \("—)/? on the right side of (5.69) disappear. 
Bringing in the hypothesis (5.64), we use the identity 


1d 


432 _ yn 
Sg Ar = de Ar 
2r dr 


(5.70) 


to write the right side of (5.69) as 


Oks Lay" dr? 
(5.71) Ch r-" f(r) (-—— ] e dr. 
0 2r dr 


Repeated integration by parts shows that this is equal to 


Qk-1F =r? 
(5.72) Cr i (sa - =) "tr F.(0)| a dn 


Now it follows from uniqueness of Laplace transforms (see the exercises) that 
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1 d\" on-ig 
(5.73) cost¥—Af(x) = Cyt ver [t fA0)|5 
for well behaved functions f on R”, when n = 2k + 1. By (5.69), we have 


i 
(5.74) C, = 5m NPA. 


We can also compute C;,, directly in (5.73), by considering f = 1. Then f,, = 1 
and u = 1, so 


i? oro 1 3 1 
615 1=0(S5) Pt =0,(b-5) (R$) 5, 


1.€., 


(5.76) oe n=2k+1. 


This simply means 


(5.77) Aok = 


(k— 3) (R-3) 9" 


a formula that is frequently derived by looking at Gaussian integrals. See formulas 
(A.4)-(A.9)) at the end of this chapter. 


To compute (sin t/—A)//—A, we use 


P i= t 
(5.78) ale) - i cos s¥ —A g(x) ds. 


From (5.73), if k > 1, 


Cty ciyadae= @* ( id 


so (5.78) becomes 


: k-1 
(5.80) sint/—A Ce (5 =) pe 


JK g(x) = 5 


The formulas (5.73) and (5.80) are for t > 0. For arbitrary t € R, use 


(5.81) costW—A = cos(—t)V—A,  sint¥—A = —sin(-t)V—A. 
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The case k = 0 is exceptional. Then (5.79) does not work. Instead, we have 


(5.82) cos tW—A g(a) = s lol +t) + 9(z —1)], 


and (5.78) gives 


sinty—A = r+s x —s)| ds 
sy | OE 5 | We+s) +909) 
=5 | se+s)as 
forn = 1. 


Let us take another look at the case k = 1, when n = 3. From (5.76), C2 = 2, 
and then (5.80) gives 


eA ola) = tl 


t 
(5.84) 7 ie f (e+ te) dS (w) 
S2 


- A 
~ Ant 


Fi g(x + y) dS(y), 


lyl=lé| 


which is equivalent to (5.47). 
To solve the wave equation (5.32)-(5.33) for u= u(t,xz), t€ R, « € R”, 
n = 2k, we can use the following device, known as the method of descent. Set 


(5.85) F(@,0n41) = f(@),  G(®,@n+1) = (2), 
and solve for U = U(t, x, %n41) the wave equation 

(5.86) 07U — AnyiU =0, U(0)=F, 0,U(0) =G. 
Then U is independent of x, and 

(5.87) u(t,xv) = U(t,,0). 


In particular, 


(5.88) costV¥—A f(a) = cos t\/—Ay41F (2,0), 
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and 
sinty—A sint,/—Anii 
(5.89) ——— g(x) = —— Giz, 0). 


Using the formula (5.73) for waves on R"+! = R?*+!, we have, for v(t, 2) = 


cost,/—Af(z), 


1 dy\* 
(5.90) v(t,2) = Cnat(= =) [SFO], 


where 
Er) =Fao) 


_ z_ | Po) +r) dS (w) 
gn 


(5.91) 
2 b 
= 2 f fe + ro) a5(w), 
with 
(5.92) SY = {ur= (w" waia) © 8” twas > 0} 


Here A,, is the n-dimensional area of S” and, we recall, 


1 
(5.93) Cr41 = ae Ay. 


Note that f#(r) = f#(—r). To proceed, map B = {y € R” : |y| < 1} to S® by 


(5.94) yy, vy), vy) = V1-IlyP. 
Then 

dy 
5.95 dS(w) = /1+|V 2 dy = ——2—., 
(5.95) (w) IVo(y)|? dy Ji- we 


and we get 
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2 dy 
(5.96) fF) = / {e+ry) > 
An vi -lyl? 
lyl<1 
Using the identity 
1 
(5.97) i h(y) dy = / H(pw)p"~* dS(w) dp, 
ly|<1 © gia 


- V1-?? 
1 n—1 
(5.98) _ 9g An-1 ‘ Pig a 
an (pr) Toe p 
A 1 


Plugging this in (5.90), we get, for a function f on R” = R?*, t > 0, 


(5.99) t/—Af( )=o4m=1¢ t id [Flt 
. — aa An n+l" \ 9¢ dt coe A Jt — 82 a 
Note that 
An-1 —n/2 2 2 
s, n = a Aly: SS SS Ss SS Se 
(5.100) 2 re Ch41= 7 1 T(n/2) (i) 


Similarly, we have 


(5.101) 


. k-1 
ee Ge [#1 of ()] 
J/—A 2 2t dt % : 


where gi (|tl) is as in (5.86), with G in place of F’, hence as in (5.98), with g,,(s) 
in place of f,,(s). Consequently, for t > 0, 


sinty—A An— 1a\*" 7 srl 
(5.102) ae A oar Ga [ 30) 


and (An—1/An)Cn4i1 = 1/T(k). 
If we specialize to n = 2 (k = 1), we get 
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sinty—A (x) fa (s) 8 d 
———- g(x) = s) ————- ds 
(ON i eae 
1 . Ss 
(5.103) = al g(a — sw) as dS (w) ds 
2 at g(x — y) 
2n lyf? ~ 
ly|<t 


which is equivalent to (5.46). 

We turn to a re-derivation of the formulas in Proposition 5.5. As before, we 
use (5.51), plus formulas for cos t\/— A. This time, we combine (5.51) with the 
formula (5.79), for a function f on R" = R?*+1, where C,, is given by (5.74). 
We get 


lee) k 
a ee ee (Sz) [e"F,(t)] at 


7 ae dt 2 
(5.104) ‘ 

= Sef (-3G) oO Fa 
Now 


(r)r”1 f(r) dr 
0 
= = . = n-1 
(5.105) ay ae | _ f(a — rw)®(r)r dS (w) dr 
- malt Fle — vy) ®(\y)) dy 
Hence, using (5.74), we obtain from (5.104) that 
(5.106) y(v—A) f( = Tq] * (ly) fla — y) dy, 


for n = 2k + 1, where 


a ae 
(5.107) o,41(r) = (-s-5) P(r). 


Another way to write (5.106) is 
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(5.108) e(V—A)s(«) = =O, ([e|), 2 eR", 


which is equivalent to (5.53). 


Remark: The case k; = 0 of (5.107) can be seen directly by the Fourier inversion 
theorem, without use of the calculations (5.104)—(5.105). 

We seek an analogous formula for y(./—A) f(a) when f is a function on R” 
with n = 2k. We get this by the following extension of the method of descent, 
which gives 


(5.109) cost¥—Af (x) = cost,/—Ay41F (2,0), 
with 
(5.110) F(@,4n41) = f(x). 


From this and (5.51), we get 


(5.111) 
p(vV—A) f(z) = y( V Agcy Fe, 0) 


Fane 


= D641 (|(Y, Yn+i)) F(@ — Ys Yn4r) dy dynqi 
= / ) 

1 

Jor 


se | be ( (Iyl? + 8?)'/?) fw —y) dy ds, 


R2k+1 


or 


(5.112) y(/—A)f (x) = se | Pm(lud ree — y) dy, 


R” 
where 


(5.113) ba(r) = | Dox (vr? +s?) ds, 


and ®5;.11 is given by (5.107). The change of variable t = Vr? + s? gives 


ia t 
(5.114) ®o,(r) = 2 | ®o441(t) ——— dt. 
ie t2 = r2 


Another way to write (5.112) is 
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(5.115) y(V—A)6(x) = i 2au(le), 


which is equivalent to (5.54). 


Exercises 


1; 


A function g € C'°(R”) is said to be in the Gevrey class G7 (R”) provided, for each 
compact K C R”, there exist C and R, such that 


(5.116) |D°g(x)| << CR*k**, lal=k, ce K. 


The class G" (IR”) is equal to the space of real-analytic functions on R”. If o > 1, show 
that there exist compactly supported elements of G'’ (IR”), not identically zero. 
Remark: This is part of the Denjoy—Carleman theorem; see [Ru]. 

Consider the following “sideways heat equation” for u = u(t, xz) on R x R: 


Ut = Una, u(t, 0) = g(t), Ux (t, 0) = 0. 


Show that if g € G7(R) for some o € (1, 2), then a solution on all of R x R is given 
by the convergent series 


co 2k 
(5.117) u(t,2) = S* —— g(t), 


which is the power series for the “formal” object (cos x/—O/ dt) g. Using Exercise 1, 
find nontrivial solutions to ut = uzz which are supported in a strip a < t < b. This 
construction is due to J. Rauch. 

(Hint: To prove convergence, use Stirling’s formula: 


(5.118) nw (Qnn)/? -e-"-n” asn — 0.) 


3. 


Given the formula (5.37) for R(t, €), that is, 
R(t, €) = (2m)-"/? |e" sin ell, EER”, 


show that the fact that R(t,-) € S’(R”) is supported in By, = {a € R” : |a| < |t]} 
follows from the Paley—Wiener theorem for distributions, given in Exercise 16 of §4. 
Exercises 4—7 provide justification for passing from (5.69)—(5.71) to (5.73). 

Take v € L*(IRT) and assume 


= —ds? 
/ v(s)e ds=0, VA>O. 
0 


Deduce that v = 0. (Hint. Use the Stone—Weierstrass theorem to show that if e,(s) = 
e~>*”, then the linear span of {e, : A > 0} is dense in Cy (R*), the space of con- 


tinuous functions on R? = [0, 00) that vanish at oo. Hence the hypothesis implies 


J v(s) f(s) = 0 for all f € C.(R*).) 
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5. Show that if instead we assume v € L™ (RT), the result of Exercise 4 still holds. (Hint. 
Consider v-(s) = e€*'u(s).) 

6. Show that if f,g € S(R”), then the solution u to (5.32)-(5.33) is bounded and contin- 
uous on R x R”. Hence deduce the validity of (5.73) for f € S(IR”). 

7. Extend the range of validity of (5.73) from f € S(R”) to other function spaces, includ- 
ing f € C™(R"). 


6. Radial distributions, polar coordinates, and Bessel 
functions 


The rotational invariance of the Laplace operator on R” directly suggests the use 
of polar coordinates; one has 


(6.1) = baa ies 


where Ag is the Laplace operator on the unit sphere $”~!. This formula has 
been used in (4.56) and follows from the formula given in Chap. 2 for the Laplace 
operator in a general coordinate system; see (4.4) in that chapter. 

Related is the fact that in treating the equations of §5 via Fourier analysis, one 
computes the Fourier transforms of various radial functions, such Fourier trans- 
forms also being radial functions (or rotationally invariant tempered distributions). 
Bessel functions arise naturally in either approach, and we will develop a little of 
the theory of Bessel functions here. More results on Bessel functions will appear 
in Chap. 8, which discusses spectral theory, and in Chap. 9, which treats scattering 
theory. One can find further material on this subject in Chapter 7 of the complex 
analysis text [Tay4], and in the treatise [Wat]. 

We begin by considering the Fourier transform of a radial function, F(a) = 
f(r), r = |a|. We have 


(6.2) PE) = nr? [ sorjulrlele" ar, 
0 

where 

(6.3) Wn (lEl) = V,(E) 

with 

om) ¥n(€) = i e€ dS(w). 
Sn-1 


In other words, with A,,_2 the volume of S”~?, 
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al 


(6.5) Un(t) = An—2 / eit (1 — 52)(n-3)/2 dp. 


= 
From (A.4) we have An_2 = anim-Y/2 /T((n — 1)/2). Itis common to write 


(6.6) nr) = (2)"/2rl-/2 7 oa (r), 


where, for general v satisfying Re v > —1/2, the Bessel function J,(z) is defined 
to be 


(6.7) J,(z) = Ir (5) i (v+ 5)| - (5) [. (1— 2)? ef at. 


For example, #2(r) = 27Jo(r). Since (1 — t?)”~!/? is even in t, one can replace 
e**" by cos zt in this formula. Now, (6.2) becomes 


638) Be) =e? [Fle dnyo-alele dr ar 


We want to consider the ODE, known as Bessel’s equation, solved by J, (7). 
First we consider the case v = n/2 — 1. Since W,, is the Fourier transform of a 
measure supported on the unit sphere, we have that 


(6.9) (A+1)¥,, =0. 


Using the polar coordinate expression (6.1) for A, we have 


2 _ 
(6.10) (F +e i+) dn (r) = 0. 


dr2 r dr 


Substituting (6.6) yields Bessel’s equation 


a? ld py? 
(6.11) E + re (1 =) J(r) =0 


in case vy = n/2 — 1. We want to verify this for all v, from the integral formula 
(6.7). This is an exercise, but we will present the details to one approach, which 
yields further interesting identities for Bessel functions. For notational simplicity, 


let us set 
a= [r(2)r (e+ 8)] a 


Differentiating (6.7) with respect to z yields 
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d Vc(V os 
ZV = ( ( \ ef ead Gee ah baa dt 
z 


1 
+ ie(n)2” f ele yo ae. 


-1 


(6.12) 


The first term on the right is equal to (v/z) J, (z). The second is equal to 


ely) ee. izt 2\v—1/2 
—— — 1-t t 


_ _ ev) oY [ eizt [a _ (2)¥-1/2 — (Qu —1)#2(1— eel ai 


eee a (2v — 1)c(v) 
aa ale) ed) Jy—1(2). 


Since c(v)/c(v — 1) = 1/(2v — 1), we have the formula 


(6.13) Sie FO) 26. 
dz z 

or 

6.14 aaa a ee 

(6.14) ($+) 2@=1240. 


As we have stated, the formula (6.7) for J,(z) is convergent for Re v > —1/2. 
The formula (6.14) provides an analytic continuation for all complex v. In fact, 
one can see directly that the integral in (6.7) is meromorphic in v, with simple 
poles at vy + 1/2 = —1,—2,.... The factor '(v + 1/2)~! cancels these poles. 
This serves to explain the desirability of throwing in this factor in the definition 
(6.7) of J,(z). Of course, the factor [(1/2)~! is more arbitrary. 

Next, we note that 


1 
Jvai(z) = ctv t+ jee eit (1 _ le dt 


-1 


v—-1/2 


= —ic(vy + 1)(2v + yz f et (1 —#?) dt, 
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and since c(v + 1) = c(v)/(2v + 1), this is equal to the negative of the second 
term on the right in (6.12). Hence we have 


(6.15) (£ - 2) 1.02) = dala 


dz z 


complementing (6.14). Putting together (6.14) and (6.15), we have 


d y—1 d V 
(6.16) (+ , ) (+ + “) J,(z) =—J,(2), 


which is equivalent to Bessel’s equation, (6.11). Note that adding and subtracting 
(6.14) and (6.15) produce the identities 


2S,(z) = Iv—1(z) — Jv4i(z), 
(6.17) 


* 32) =F Gata: 


Note that, by analytic continuation, J_,(z) is also a solution to Bessel’s equa- 
tion. This equation, for each v, has a two-dimensional solution space. We will 
examine when J,(z) and J_,(z) are linearly independent. First, we will obtain 
a power-series expansion for J,,(z). This is done by replacing e’ by its power- 
series expansion in (6.7). To simplify the expression for the coefficients, one uses 
identities for the beta function and the gamma function established in Appendix A. 
From (6.7), we have 


(6.18) 7 . 
ie) = Ir @ r ( | 5) G)> an [ (eo ee ee 


The identity (A.24) implies 


- 7 r(b+3)0(v+3) 
Pay a Py ge 2 2 
- ( ) - T(ktv4+1) 


so 


(2/2) 1 vk (R+ AE +4) 
Jz) = = ax > ei) , 
Tr (s)0 (vt 5) — (2k)! T(k+vu+1) 
Setting (2k)! = ['(2k + 1), and using the duplication formula (A.22), which 
implies 
T(k + 3) aes 


T(S)r(2k+1)  P(k+1)’ 


we obtain the formula 
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ab _4)k z\ 2k 
(6.19) J(z) = (5) ee 1) © : 


This follows from (6.18) if Re y > —1/2, and then for general v by analytic 
continuation. In particular, we note the leading behavior as z — 0, 


Aa 


and 
guo-l 
(6.21) Ji(z) = >t + O(2”). 


The leading coefficients are nonzero as long as v is not a negative integer (or 0, 
for (6.21)). 

From the expression (6.19) it is clear that J,(z) and J_,(z) are linearly inde- 
pendent provided v is not an integer. On the other hand, comparison of power 
series shows 


(6.22) J_n(z) = (-1)"Jn(z), 12 =0,1,2,.... 
We want to construct a basis of solutions to Bessel’s equation, uniformly good for 
all v. This construction can be motivated by a calculation of the Wronskian. 
Generally, for a pair of solutions w; and wz to a second-order ODE 
a(z)u” + b(z)u’ + c(z)u = 0, 
uz and uz are linearly independent if and only if their Wronskian 
(6.23) W(z) = W(u1, u2)(z) = wiuyg — u2uy 
is nonvanishing. Note that the Wronskian satisfies the first-order ODE 


(6.24) W'(z) = -? W(z). 


In the case of Bessel’s equation, (6.11), this becomes 


(6.25) W'(z) = = 
sO 
(6.26) W(2) = = 
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for some K (independent of z, but perhaps depending on v). If uy = Jp, u2 = 
J_,, we can compute K by considering the limiting behavior of W(z) as z > 0. 
From (6.20) and (6.21), we get 


(6.27) 
ae 7 1 1 Lf ,sin ny 
( Vs —w) (2) _ r(vyr(1 = v) Tv + 1)T'(-v) Zz nz 


making use of the identity (A.10). This recaptures the observation that J, and 
J_, are linearly independent, and consequently a basis of solutions to (6.11), if 
and only if v is not an integer. 

To construct a basis of solutions uniformly good for all v, it is natural to set 


JL(z) cosmv — J_,(z) 


(6.28) Y,(z) = - 
sin Ty 
when rv is not an integer, and define 
1 v -v 
(6.29) eo Siavoa] (2! pee 
von Tv OV OV v=n 

We have 

2 
(6.30) W(dv,¥)(2) = —, 

TZ 


for all v. Another important pair of solutions to Bessel’s equation is the pair of 
Hankel functions 


(6.31) H(z) = D(z) +iY(z), H(z) = J,(z) -iY_(2). 


For H, O) , there is the integral formula 


Qe7 Tw g\V co 
6.32 HO = | tat t? ah v—1/2 dt, 
ae p= Far @aa) (5) eee) 


for Re vy > —1/2, Im z > 0. Another formula, valid for Re v > 5 and Re 
z>0,is 


(6.33) 


9 1/2 i(z—mv/2—7/4) poo v—1/2 
H{() = ee Patera 
TZ T (v + $) 0 21z 


To prove these identities, one can show as above that each of the right sides of 
(6.32) and (6.33) satisfies the same recursion formulas as J,,(z) and hence solves 
the Bessel equation; thus it is a linear combination of J,,(z) and Y,(z). The coeffi- 
cients can be found by examining the limiting behavior as z — 0, to establish the 
asserted identity. Hankel functions are important in scattering theory; see Chap. 9. 
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It is worth pointing out that the Bessel functions J,41/2(z), etc., for k an 
integer, are elementary functions, particularly since they arise in analysis on odd- 


dimensional Euclidean space. For v = k + 1/2, the integrand in (6.7) involves 
(1 — t?)*, so the integral can be evaluated explicitly. We have, in particular, 


9 \ 1/2 
(6.34) Jy j2(Z) = (=) sin z. 
TZ 


Then (6.14) gives 


9 \1/2 
(6.35) Japle)= (4) COS Z, 


which by (6.28) is equal to —Yj /2(z). Applying (6.16) and (6.14) repeatedly gives 


k 


: d ;—1 4 
(6.36) Jnasja(z) = (=1)* Il (+ d 2) sin z 


z 27 z 


j=1 


and the same sort of formula for J_;,—1/2(z), with the (—1)* removed, and sin z 
replaced by cos z. Similarly, 


(1) oN ee 
(6.37) Hy (2) = -t — e”*, 


with a formula for Ha) similar to (6.36). 
We now make contact between the formulas (6.2)—(6.6) and some of the for- 


mulas of §5, particularly from Proposition 5.5. Note that if F(a) = f(|2|), then 


(6.38) F(a) = (2n)"/? f(W—A)6. 
Thus, as in (5.51), we have 
(6.39) F(x) = (2n)"/?-} i, f(r)e"" Ri (t,x) dt dr, 


where Ri (t,7) = (0/0t)Ry (t,x), and R,,(t, x) is the Riemann function given 
by (5.43). Comparison with (6.2) gives 


inlal) = @nyrtrn f ” eit RE (t,0) dt 


—co 


or, equivalently, 
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n/2-1 poo 
(6.40)  JInjo—1(r|a|) = 2(20)"/?2-1 (4) i (sin tr)R,(t, x) dt. 


r —oo 


Note that using R3(t, x) = (47t)~'65(|t| — |x|) gives again the formula (6.34) for 
Ji /2(r). Note also that the recursive formula 


O 


Fatal 9) =~ on5 Os 


R(t, s) 


used in the proof of Proposition 5.5, when substituted into (6.40), gives rise to the 
formula (6.15), in the case v = n/2 — 1. 
Instead of synthesizing functions of A via the formula (5.51), we could use 


(6.41) g(—-A) = (ony? f ger dt, 


where the operator e~**“ is obtained from the solution operator e’* to the heat 


equation by analytic continuation: 
(6.42) e "4 5(x) = (—A4mit)-"/2el"l"/4#, tg 40. 


If f(r) = g(r?), with g real-valued and even, we get 


at) - 2. [- ee 
(6.43) g(t) = al f(r)(cosr?t)r dr 
and hence 
(6.44) g(—A)d = 2[- fcr? (cos rt) elt /4% F (r)p dr dt. 
—oco JO 


Comparison of this with (6.2) gives 


(6.45) nose / (cos 24) el#?/4ity—n/2 gp 


—co 


where, for n odd, we take t~"/? = lim, o(t — ie)~"/?. Note that (6.45) is an 
improper integral near t = 0. 

We will not look in detail at implications of (6.41)—-(6.44), which are generally 
not as incisive as those of Proposition 5.5, but we will briefly make a connec- 
tion with the idea, used in 85, of synthesizing operators from the heat semigroup. 
Recall particularly the formula for the resolvent: 
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ce 2 
(6.46) OF A) = eo Ae di, 
0 


Generalizing (5.28)—(5.31), we have, for \ > 0, 
(6.47) Oe -ayt6= | ele? /4t-97t (4p) 1/2 dt. 
) 


A superficial resemblance with (6.45) suggests that this function is related to 
Bessel functions. This is consistent with the fact that the resolvent kernel Ry (x) = 
(A? — A)~16 = R®. , (||) satisfies the ODE 


(6.48) 


a? n—-1 
r 


aa - x Raya. FS 0, 


as a consequence of the formula (6.1) for the Laplace operator in polar coor- 
dinates; this is similar to (6.10), with 1 replaced by —)?*. In fact, there is the 
following result. From (6.47), 


(6.50) Ki (r) = 3(5) f ent /4t-t pol—> ap. 
0) 


Simple manipulations of (6.50) produce the following analogues of (6.14)—(6.16): 


(+ : “) Kr) = —Kyslr), 


(6.51) 
2v 
Kysa(r) — Ky-a(r) = = Ki(r), 
so we have the ODE 
@ id 2 
52 1 K _ 
ae Fae ( +4) v(r) =0, r>0, 


which differs from Bessel’s equation (6.11), only in one sign. The ODE (6.52) is 
solved by J,(ir) and by Y_(ir), so K_(r) must be a linear combination of these 
functions. In fact, 


1 . 
(6.53) K,(r) = amie? HW (ir), r>0. 
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A proof of (6.53) can be found in [Leb], Chap. 5; see also Exercise 4 below. When 
v = 1/2, (6.53) follows from (6.33) and (6.34) together with the identity 


1/2 
(6.54) Ky a(n) = (=) oor 
if 

which in turn, given (6.50), follows from the subordination identity. Then the 
recursion relation (6.51) and analogues for the Bessel functions imply (6.53) for 
v of the form v = k + 1/2, when k is a positive integer. 

We mention that it is customary to take a second, linearly independent, solution 
to (6.52) to be 


(6.55) L(r) =e"™”/* J (ir), r>0. 


The functions K,(r) and I,,(r) are called modified Bessel functions; also K,(r) 
is sometimes called MacDonald’s function. 


Exercises 


1. Using the integral formula (6.7), show that, for fixed vy > —1/2, as z — +00, 


(6.56) ce ey : 


TZ 


Z-— *) + O(z7*/?), 


(Hint: The endpoint contributions from the integral give exponentials times Fourier 
transforms of functions with simple singularities at the origin.) 
Reconsider this problem after reading §§7 and 8. 


Similarly, using (6.33), show that, for fixed vy > —1/2, as z — +00, 


9\1/2 . . 
(6.57) #2) = (2) een LO), 


Tz 
2. Using the integral formula (6.50), show that, for fixed v, as r — +c0, 


Tv 


a e” [1 + O(r~*)] F 


Ku(r) = ( 
(Hint: Use the Laplace asymptotic method, such as applied to the gamma func- 
tion in the appendix to this chapter; compare (A.34)-(A.39). To implement this, 
rewrite (6.50) as 


K,(r) =2 1 Up ‘ 2 r(s+1/4s) . 1 ’ ds. 
0 


Note that p(s) = s + 1/4s has its minimum at s = 1/2.) 
3. Using the definition (6.55) for I, (r), and plugging z = ir into the integral formula 
(6.7) for J,(z), show that, for fixed v > —1/2, as r > +00, 
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. Show that, for r > 0, 


Kl) = rope h(a) foe ae 


by showing that the function on the right solves the modified Bessel equation (6.52) and 
has the same asymptotic behavior as r — +00 as K,(r) does, according to Exercise 
2. Hence establish the identity (6.53). 

. Fory,a > 0, € € R”, consider 


F(G) =e MEP DN? 


Applying the subordination identity (5.22) to A? = \é (? +a”, and taking Fourier trans- 
forms of both sides of the resulting identity, show that 


F(a) = cup | e Y t1e1?)/4t ota? 4—(m+3)/2 ay 
0 


=c,ya’r” K.(ar), 


with v = (n+1)/2, r? = |a|? + y?. 
. Using analytic continuation involving both y and a, find an expression for the funda- 
mental solution to 


utt + 2auz — Au = 0, 


for u = u(t,x), t € R, « € R”, where a is a real number. Be explicit in the case 
n = 2, using the elementary character of K’3/2(z). 

. Show that, under the change of variable u(r) = r*f (cer), Bessel’s equation (6.11), 
u(r) + (1/r)u'(r) + (1 — v?/r?)u(r) = 0, is transformed to 


(6.58) POA P+ 


with 

A=2a+1, w=c, Mv =v? — a’. 
. Suppose in particular that v(x) = f(r)w(), @€ O C S™~', and Asw = —A?w. 
Show that the equation Av = —p:7v is equivalent to (6.58), with A = n — 1, so 
a = n/2—1. Thus f is a linear combination of r'~"/? J, (ur) and r!~"/? HS” (ur), 
with v = [\? + (n — 2)?/4] 7”. 
. Show that, complementary to (6.20), we have, for v > 0, 


Hp aa 2 (2). #\0. 


TT z 
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7. The method of images and Poisson’s summation formula 


We discuss here techniques for solving such problems as the Dirichlet problem 
for the heat equation on a rectangular solid in R”, defined by 


(7.1) O={#eR’ 10 <a ea, 17st. 


That is, we want to solve 
Ou 


(7.2) ay ~ Au =0, 


for u = u(t, x),t > 0,x € Q, subject to the boundary condition 


(73) Ularxan = 0 
and the initial condition 
(7.4) u(0,7)= f(x), rEQ. 


There are two ways of doing this. One involves using Fourier series on the torus 
R” /21, where I is the lattice in R” generated by a; e; (e; being the standard basis 
of R”). The other is to use the solution on R* x R” constructed in §5 together 
with the method of images, described below. Comparing these methods provides 
interesting analytical identities. 

The method of images works as follows. Let u* solve the heat equation 


# 
(7.5) du” _ Au# =0 on Rt x R®, 
at 
with initial data 
(7.6) Or Og) a7 (a), 


where f#* = f on Q and f* is odd with respect to reflections across the walls 
of all the translates of 2. by elements of I’. The set of such translates is a set of 
rectangles tiling R”, and f* is uniquely determined by this prescription. Since 
reflections are isometries, it follows that, for each t > 0, u(t,-) is odd with 
respect to such reflections; since u# is smooth for t > 0, it must therefore vanish 
on all these walls. The restriction of u* (t,x) to R*+ x Q is hence the desired 
solution to (7.2)—-(7.4). 
The same sort of technique works for the wave equation on R x Q, 


2 
ge Roxb on R x Q, 


(7.7) - 


with Dirichlet boundary condition 


7. The method of images and Poisson’s summation formula 285 
(7.8) u(t,v) =0, fora € OQ, 
and initial condition 
(7.9) u(0,2) = f(x), u,(0,x2) = g(x) onQ. 


One takes odd extensions f*, g#, as above. 

One can apply the method of images to regions other than rectangular solids. 
It applies when 2 is a half-space, for example; in that case, only one reflection, 
across the hyperplane OQ, is involved. Similarly, one can treat slabs, bounded by 
parallel hyperplanes, quadrants, and so on. One can also treat different boundary 
conditions. If one extends f above to be even with respect to these reflections, one 
obtains solutions with Neumann boundary condition satisfied on 02. 

Another type of boundary condition to impose is a periodic boundary 
condition: 


(7.10) u(t,2) =u(t,a+7) ifa,c+ye OQ, 7 ET. 


The solution to (7.1), (7.4), (7.10) is obtained as follows. Let f°(x) = f(x) for 
x €Q, let f°(x) = 0 for x € Q, set 


(7.11) f(e)= >> (e+), 


ver 


and let u?(t, 2) be the solution to 


b 
(7.12) a — Au’ =0 onR* xR”, u?(0,x) = f(z). 


Note that if u°(t, 2) is defined by 


Ou® 0) + n 0 0 
(7.13) ap 7 Ae =0 onRt xR”, w'(0,x) = f°(a), 
then 
(7.14) u(t.) = So u(t,2 +7). 


yer 


In this case, u(t, x) is the restriction of u?(t, x) toR* x Q. 
Let us specialize to the case T = (27Z)", f = 6. We have the fundamental 
solution, satisfying periodic boundary conditions, given by 


(7.15) H(t,z) = (4nt)—"/? S- eo letemk|? /4t 
kez” 
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On the other hand, identifying R” /(27Z)” with T”, we obtain via Fourier series 


(7.16) H (t,x) = (2n)~” S- ete? +ie-w. 
Lez” 


Comparing these formulas gives the following important case of Poisson’s 
summation formula: 


n/2 
(7.17) S- e let 2mk|? /4t = (=) s eile lelPt_ 
TT 


keZn fez 
We now show how the special case of this for n = 1 implies the famous 
functional equation for the Riemann zeta function. With n = 1, x = 0, and 


t = m/rT, (7.17) yields the identity 


co 


(7.18) 3 ene ar en ae aa 


n=—Cco n=—Cco 


In other words, with gi(7) denoting the left side of (7.18), we have gi(T) = 
7~1/2g,(1/r). This is a transformation formula of Jacobi. It follows that if 


(7.19) (oss a", 
n=1 
then 
1 ol 
(7.20) I) =—5+ so i ye), 


Now (7.19) is related to the Riemann zeta function 


CO 


(7.21) ¢(s)= Son (Res >1) 


n=1 


via the Mellin transform, discussed briefly in Appendix A, at the end of this 
chapter. Indeed, we have 


| g(t)t®—! dt = Le d enn 48-1 gy 
0 
- yor af ae eg 


= ¢(28) m* T(s). 


(7.22) 


Consequently, for Re s > 1, 
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Ss Co 
r(3) n8/2¢(s) = | g(t)ts/2- dt 
0 


1 foe) 
=|} gee" a+ [ g(t)ts/?-! dt. 


0 il 


(7:33) 


Now, into the integral over [0,1], substitute the right side of (7.20) for g(t), to 
obtain 


=) w#/2¢(s) = 
(7.24) (3) ; i 


il [oe] 
+f PG aia ali a+ | g(t)t®/?-! dt. 
0 


1 


1 
(-; 2 tt) 49/21 at 


We evaluate the first integral on the right, and replace ¢t by 1/t in the second 
integral, to obtain, for Re s > 1, 


8 1 1 te 
(7.25) (=) n-#¥¢(3) = ——— — = +f le”? 4 y-2)/2) g(t)t-} dt. 
Ali 


Note that g(t) < Ce~™ for t € [1,0o), so the integral on the right is 
an entire analytic function of s. Since 1/I'\(s/2) is entire, with simple zeros at 
s = 0, —2,—4,..., as shown in Appendix A at the end of this chapter, this implies 
that ¢(s) is continued as a meromorphic function on C, with one simple pole, at 


s = 1. The punch line is this: The right side of (7.25) is invariant under replacing 
s by 1 — s. Thus we have Riemann’s functional equation 


(7.26) iA (5) n8/2¢(s) =T (=) n~-8)/2¢(1 — 5), 


The functional equation is often written in an alternative form, obtained by 
multiplying both sides by I'((1 + s)/2), and using the identities 


1l-s l+s T 
T T = 
( 2 ) ( 2 ) sin $7(1—)’ 
8 oe er 
r(s)r( ; = r/2P(s), 


which follow from (A.10) and (A.22). We obtain 


(7.27) 


TS 


(7.28) ¢(1—s) =2!-*n-8 (cos =) T(s)¢(s). 
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Exercises 


1. Apply the method of images to find the solution to the heat equation on a half-line: 


2. Similarly, treat the wave equation on a half-space: 


ute —- Au = 0, tER, 11 > 0, 
u(0,2) = f(x), ue(0,2) = g(x), u(t,0,2’) =0, 
where x = (x1, 2’) € R”. 
3. Given u € S(R"), define f ¢ C(T") by f(x) = Voy egn ula + 27v). Show that, 
for 2 € Z", we have f (0) = (27)~"/?4(£) and hence 
(7.29) S> ula + 2k) = (20)? S* ae". 
k £ 


Show that this generalizes the identity (7.17). 

4. Let (ag) be polynomially bounded, and consider v = ee age”, pictured as 
a 27Z”-periodic (tempered) distribution on R” rather than as a distribution on T”; 
v € S’(R"). Show that 


(7.30) 6 = (2m)? S~ ae de € S'(R”). 
LEzZnr 


Relate this to the result in Exercise 3. 
5. Show that ¢(s) satisfies the identity 


¢(s) = ue ee ee Res >1, 


the product taken over all the primes. This is known as the Euler product formula. 


8. Homogeneous distributions and principal value 
distributions 


Recall from §4 that the fundamental solution of the Laplace operator A on R” is 
Ger” (ifn > 3), which is homogeneous. It is useful to consider homogeneous 
distributions in general. The notion of homogeneity is determined by the action 
of the group of dilations, 


(8.1) D(t)f(x) = f(tx), t>0. 


Note that D(t) : S(R”) > S(R"). Also, if f,g € S(R”), a change of variable 
gives 
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(8.2) / g(«) D(t) f(a) de = 1°" i f(x) D(t)g(e) ae. 


Thus we can define 


(8.3) D(t) : S’(R”) — S'(R") 
by 
(8.4) CDOw =F" (De) Fu) 


for f € S(R"), wu € S’(IR"). We say that u € S’(IR”) is homogeneous of degree 
m if 


(8.5) D(t)hu=t™ u, forallt > 0. 
Here, m can be any complex number. Let us denote the space of elements of 


S'(R”) which are homogeneous of degree m by H,,(R”). It is easy to see that if 
F is the Fourier transform, then 


(8.6) FD(t) =t "D(t")F, 
SO 
(8.7) Fe Hy (R”) — Hwa (R"). 


Before we delve any further into H,,,(R”), we should aver that one’s real 
interest is in elements of H,,(R”) which are smooth outside the origin, so we 
consider 


(8.8) H# (R”) = {u € Hm(R”) : u € C©(R” \ 0)}. 
It is easy to see that 


(8.9) weH#(R") => Dove H*_),(R") and 2%ue He 


m+|al 


(R”). 
We claim (8.7) can be strengthened as follows. 


Proposition 8.1. We have 


(8.10) F: Ht 3 H*,_,,(R"). 

The only point left to prove is that if u € H# (R”), then @ is smooth on R” \ 0. 
Taking py € C§°(R”), p(x) = 1 for |a| < 1, we can write u = yu +t (1 - y) 
u = uy + U2 with uy € E’(R”) and ug € C®(R”), homogeneous for || large. 
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We know that ti; € C°°(R”), so it suffices to show that ti2 € C°°(R” \ 0). This 
is a special case of the following important result. 
For m € R, we define the class Sj”(IR”) of C°°-functions by 


(8.11) pe S™(R") <> |D2p(zx)| < Ca(zy”—!", for all a > 0. 


Clearly, S1”(IR”) C S’(R”). It is also clear that uz € SR*™(R”), so the proof of 
Proposition 8.1 is finished once we establish the following. 


Proposition 8.2. If p € Sj’(R”), then p € C™(R” \ 0). Also, if p € C§°(R"), 
and p(x) = 1 for |x| < a (a> 0), then (1 — y)p € S(R"). 


Proof. We will show that if 3 is large, then x is bounded and continuous, and 
so are lots of derivatives, which will suffice. Clearly, 


F: S'(R") — L*(R")NC(R"), for p< —n. 
Now, given p € $17”(R”), then Dp € sm Plan, so 
«6 = F(D®p) € L° NC, for|B) >m+n, 
and more generally x° Dp € gm l8l+lel any so 
(8.12) D%(x°p) = F(x*D*p) € L° NC, for |B] >m+n+|al. 


This proves Proposition 8.2. 


Generally, there is going to be a singularity at the origin for an element of 
H# (IR”). In fact, there is the following result, whose proof we leave as an 
exercise. 


Proposition 8.3. If there is a nonzero u € H#(R”") AM C%(R"), then m is a 
nonnegative integer and u is a homogeneous polynomial. 


Let us consider other examples of homogeneous distributions. It is easy to see 
from the definition (8.4) of the action of D(¢) that 


(8.13) 5 €H*, (R"). 


Of course, 6 is zero on R” \ 0! Since F5 = (27)-"/? € Hi (R"), this result is 
consistent with Proposition 8.1. For more examples, choose any 


(8.14) we ows), 


and consider, for any m € C, 
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(8.15) Um (x) = |x| w(|2|-1z), 2 €R"\O. 


If Rem > —n, then u,, € Lj,.(IR"), so it defines in a natural manner an element 
of S’(IR"), which belongs to H# (R”). Thus 


(8.16) D* tm € HE _),(R") (Rem > —n). 


If Re m < —n, then un, ¢ Li.(IR”). In the borderline case m = —n, it is 
significant that there is a natural identification of u,,, with an element of S’(R”), 
under the further condition that 


(8.17) / w(x) dS = 0. 
Sn-1 


The element of S’(IR”) is called a principal value distribution and is denoted 
PV um. We establish this as follows. Pick any radial y € S(R"”) such that y(0) = 


1, such as y(x) = ell”, Then, for any v € S(R”), with w_, as in (8.15), 
u—n(x)[v(x) — v(0)p(z)] belongs to L1(IR"), so we can define 


(8.18) (v, PV u_n) = ica) [v(z) — v(0)p(x)] de. 
R” 


Note that (8.17) is precisely what is required to guarantee that the right side of 
(8.18) is independent of the choice of » (satisfying the conditions given above). 
Thus we can write, for any ¢t > 0, 


(D(t)v, PV u_n) = fun) [u(tx) — v(0)p(tx)] dx 


R 
(8.19) = re funla/t) [v(z) — v(0)p(x)] dx 
Rv 
= (v, PV u_n). 
In light of (8.4), this implies 
(8.20) PV u_n € H*,(R"), 


provided (8.17) holds. By Proposition 8.1, we have 
(8.21) F(PV u_n) € H#(R"). 


In particular, this Fourier transform is bounded. Consequently, the convolution 
operator 
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(8.22) Tv = (PV u_n) * 0, 
a priori taking S(IR”) to S’(IR”), has the property that 
(8.23) T : L?(R”) — L?(R"). 
Continuity properties of such a convolution operator on L?(R”), for 1 < p< oo, 
will be demonstrated in Chap. 13. 
The special one-dimensional case of a principal value distribution has been 


discussed in §4. In analogy with (4.34), we have the following. 


Proposition 8.4. Under the hypothesis (8.17), we have, for v © S(R”), 


(8.24) (v, PV u_n) = lim U(@) U-n(x) da, 


where Bz = {x € R”: |a| < e}. 


Proof. Since u_,,(x)[v(z) — v(0)p(z)] € L1(R”), via (8.18), we have 


(uv, PV u_n) = lim u—n(z)[v(x) — v(0)y(x)] da, 


e—0 
R"\Bz 


so (8.24) follows from the observation that if (8.17) holds, then 
U_n(x)p(x) dx =0, forall ¢ > 0, 
R”\Be 
for any radial y € S(R”). 


In general, if u(x) has the form (8.15) with m = —n, then wu is a sum of a 
term to which (8.17) applies and a constant times r~”. Now one can still define a 
distribution in S’(IR”), equal to r—” on R” \ 0, by the prescription 


(8.25) (vu, Egr™") = fe [v(z) - v(0)(2)] dx, 


R” 


for any given radial y € S(R”) satisfying y(0) = 1. This time, E,r~” € S’(R”) 
depends on the choice of y. One has 


(8.26) Eyr-? — Eyr-” = ( / ite) = ete)] r"de) - 


Also, E,,r—” is not homogeneous. Instead, one has 
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(8.27) D(t)(Eyr™”) =t Epwayer™, 
and by (8.26) this yields, after a brief calculation, 
(8.28) D(t) (Egr7") =t "Egr—” + An_it™ (log t)6, 
where A,,_; = vol(S"~'). This implies for the Fourier transform of E,r—" that 
(8.29) D(t)F(Egr—”) = F(Ear—”) + (20) 7-7/7A,_1 logt, 
which, in view of rotational invariance, implies that 
(8.30) F(Epr—")(€) = (20)~"/? An_1 log |€| + B, 


where B is a constant, depending in an affine manner on y. A “canonical” choice 
of E,,r—” would be one for which B = 0; such a distribution E,r~” € S'(R”) 
is denoted PF’ r~” (for “finite part’); we have 


(8.31) F(PF r-")(€) = (2n)~/7-A,_1 log |€|. 


Note that this is consistent with (4.59) when n = 2. 

It turns out that r~™, which is holomorphic in {m € C : Rem > —n}, 
with values in S’(IR"), has a meromorphic continuation. This can be perceived as 
follows. First note that if —n < Rem < 0, then both r™ and r~”"~™ belong to 
L,.(R”), so from Proposition 8.1 and rotational invariance we deduce that 


(8.32) F(r™) =c(m)r™", 


T 
(8.33) F (vm) = gmenp2t 


for —n < Rem < 0. This can be deduced from (8.32) and Parseval’s identity, 
which gives 


(8.34) (u,r™”) = c(m)(a,r—™~"). 
If we plug in u(x) = e-l#l?/2 = a(x), both sides of (8.34) can be evaluated by 
integrating in polar coordinates. The left side is 


(8.35) 


| [ pmtno lr /2 dr= ae) ee gom+n)/2-1 e° ds 
0 0 


= 2049-227 (1m +n) An 
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and the right side of (8.34) is similarly evaluated, giving (8.33). 

Now the left side of (8.33) extends to be holomorphic in Re m > —n, with 
values in S’(R”), while the right side extends to be meromorphic in Re m < 0, 
with poles atm = —n,—n — 2,—n —4,..., due to the factor [((m + n)/2). 
Thus we have the desired meromorphic continuation. With r”” so defined, 


(8.36) r™ €HH(R"), mA -—n,—-n-2,-n—-4,...; 


indeed, D(t)r™ — t™ r™ is a meromorphic function of m which vanishes on a 
nonempty open set. As we have seen, PF’ r~” can be defined by a “renormaliza- 
tion,” though it does not belong to H*, (R”). 

Let us now consider the possibility of extending um, of the form (8.15), to an 
element of S’(R”), in case 


(8.37) m=-n—j, j=1,2,3,.... 


In analogy with (8.18) and (8.25), we can define EF; ,,u,, in this case by 


v( (0) 
a! 


a 


xp(x)| da, 


(8.38) (v, Ej,ptm) = f um(a) v(x) — >> 


lal<j 


provided y € S(R”) is a radial function such that (0) = 1 and 1 — vy vanishes 
to order at least 7 at 0; for example, we could require y(x) = 1 for |a| < c. The 
dependence on y is given by 


(8.39) Ej,gtim — Ejytim = >> Bale — b) 5, 
lal<j 


where 


1 


~ al 


(8.40) Baly — ¥) / 2 [p(a) — P(2)] tum (2) ae. 


In analogy with (8.27), we have 
(8.41) D(t) Ej, ptm =U" Ej pit)ptm, 


and hence, given (8.37), by a calculation similar to that establishing (8.28), 
D(t)(Ej,pttm) =O" Eig +4" YT a(t = 16 
lal<j 


+t™ logt > Voy 56), 


Ja|=J 


(8.42) 
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for certain constants 7., which depend in an affine fashion on y. Consequently, if 
we set 


(8.43) Bum = Ejyptm— > Yad™, 


lal<j 


we have another element of S’(R”) which agrees with u,, on R” \ 0, and 


(8.44) D(t)(Ettm) = t” Bum + t™(logt) S> Yq 5. 


lal=J 
It follows for the Fourier transform F(Eur,) that 


D(t)F(Eum) =F (Bum) + #(logt) S> 7, €*. 


lo|=J 


Consequently, if F(Bum) = w(E) for |€| = 1, we have 


F (Bum) (té) = tw(€) + (logt) > yy (t€)%, for |é| = 1, 


Jal=J 
and hence 
(8.45) F (Etm)(£) = w;(€) + p; (8) log |él, 
where 


(8.46) wy € ae (R”) and p; is a homogeneous polynomial, of degree j. 


We leave it as an exercise to the reader to construct a similar extension of uy, 
to an element of S’(R”), when Re m < —n and m is not an integer. In such a 
case one can produce an element of 17 (R"); log terms do not arise. 


Exercises 
1. More generally than S7”(R”), for 0 < p < 1, define S7”(IR”) by 
pe S™(R") <> [D2p(x)| < Cala)” ?'"!, for all a > 0. 


Show that 6 € C’°°(R” \ 0) in this case, as in Proposition 8.2 
2. Define p € C*(R” \ 0) by p(é) = (6&1 + [€'|?)~*,€ = (Er, €25---&n) = (61, €').- 


Show that p(€) agrees outside any neighborhood of the origin with a member of 
= n 


1/2(R 
3. Prove Proposition 8.3 
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4. If—n < Rem < Oand wm is of the form (8.15), then um and dim belong to Ligg (R”), 
with tm € Hit, tim € HT Hence 


ai(x) = |2|-""-"Win(||*e), Win € CR (S"-2), 


Study the transformation w ++ W,,. Use this to produce a meromorphic continuation 
of Um. 

5. Study the residue of the meromorphic distribution-valued function r* at z = —n, and 
relate this to the failure of PF’ r~” to be homogeneous. 

6. Incase n = 1 and m = —s, the formula (8.33) says 


1-s 
F(r-*) = 91/2-s r( = ) ret for 0 < Res < 1, 
l'(3) 


while Riemann’s functional equation (7.26) can be written 


¢(s) -_ qo 1/2 ) 
= 5) ry 


Is this a coincidence? (See [Pat], Chap. 2.) Note that these formulas yield 


(Qr)9/? 
¢(s) 


- (2r)4-8)/2 soe 
= R 1. 
F(r *) C(l—s) rr", O< Res< 


9. Elliptic operators 
A partial differential operator P(D) of order m, 


(9.1) PO\= > 2.0% 


lal<m 


is said to be elliptic provided 


(9.2) |P(€)| > Clg], for |é| large. 
Here P(£) = S>aaé*%. The paradigm example is the Laplace operator 
A= P(D), with P(€) = —|€|?, which is elliptic of order 2. In this section 


we consider some important properties of solutions to 
(9.3) P(D)u=f 
when P(D) is elliptic. 
The hypothesis (9.2) implies the following. If (9.2) holds for |€| > C), and if 
yp € C§°(R”) is equal to 1 for |€| < C4, then 


(9.4) a(é) = (1— v(€)) P(€)* € S7™(R"), 
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where $7"(IR”) is the space defined by (8.11); we call it a space of “symbols.” 
Now consider 


(9.5) E = (2n)-"/7G € S'(R”). 


By Proposition 8.2, we know that F’ is smooth on R” \ 0 and rapidly decreasing 
as || — oo. If we set 


(9.6) v= P(D)E, 
then 
(9.7) a(€) = (20)-"/7 (1 — y(€)). 


In other words, 


(9.8) P(D)E=6+u, 
with 
(9.9) w = —(2n)~"/?6 € S(R"). 


We say F is a parametrix for P(D). It is almost as useful as a fundamental 
solution, for some qualitative purposes. For example, it enables us to say a great 
deal about the singular support of a solution u to (9.3), given f € D’(R”). The 
singular support of a general distribution u € D’(IR”) is defined as follows. Let 
QC R” be open. We say wu is smooth on 2 if there exists v € C'°°(Q) such that 
u = von. The smallest set K for which u is smooth on R” \ K is the singular 
support of wu, denoted 
sing supp wu. 


For example, sing supp 6 = {0}; also sing supp |z|?~-” = {0}, ifn # 2. Now, 
suppose that u € €’(IR”) and (9.3) holds. Then 


(9.10) Ex f=Ex* P(D)u=(P(D)E)*«u=utwxu 


and, of course, 
wx*ue C~(R"). 


On the other hand, it is easy to see that, for any f € E’(R”), 
(9.11) sing supp E' « f C sing supp f, 


provided sing supp E Cc {0}. More generally, for any fi, fo € €’(R”), if sing 
supp f; C Kj, then 


298 3. Fourier Analysis, Distributions, and Constant-Coefficient Linear PDE 
(9.12) sing supp fi * fo C Ky, 4+ Ko, 


a result we leave as an exercise. 

Noting that we can multiply distributions by cut-offs x € C§°(IR”), equal to 
1 on an arbitrarily large set, we deduce the following result, known as elliptic 
regularity. 


Proposition 9.1. For any u € D'(R”), if (9.3) holds with P(D) elliptic, then 
(9.13) sing supp u = sing supp f. 


Finally, we want to make a detailed analysis of the behavior of the singularity 
at the origin of the parametrix F for an elliptic operator P(D). Since E is given 
by (9.4) and (9.5), with P(g) = Eee Aa&° a polynomial, it follows that, for 


|€| large, 


(9.14) ag) ~ S- aj(E) 


j20 


where each g; € C™(R”), and, for || > C, g;(€) is homogeneous in € of degree 
—m — j. The meaning of (9.14) is that, for any NV, 


N-1 
(9.15) - So a( n(é) € Sp™ 4 (R"). 
j=0 
Consequently, 
(9.16) E~ (an)? 1G; 
j20 


in the sense that, for any K, one can take N large enough that 
(9.17) (Qn)-"/? >» Gj = (20) -"P#y € C¥(R"). 


Now, we can replace each gq; by q € C™(R” \ 0), equal to q; for |€| large, and 
homogeneous of degree —m — j on R” \ 0, and replace each qh by qe € S’'(R”), 
equal to q on R” \ 0, such that qe en* ifm+j <n, orin re event satis- 


is 
fying the counterpart of (8.44)-(8.46). Note that, for each 7, q; -qt € E€’(R"), so 
the Fourier transform of the difference belongs to C™° (R”). We have established 


the following. 


Proposition 9.2. A parametrix E for an elliptic operator P(D) of order m 
satisfies the condition that FE € C%(R” \ 0), and the singularity is given by 
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(9.18) E~ S\(Ee + pe(x) log |2\), 
£>0 

where 

(9.19) Ey € HF _4(R") 


and pe¢(x) is a polynomial homogeneous of degree m—n-+ &; these log coefficients 


appear only for £>n—m. 
More generally, this result holds for EF = (27)~"/?q whenever q € 9 { has an 
expansion of the form (9.14), for any m € R, and log terms do not arise if m is 


not an integer. 


Exercises 


1. Using Exercises 1 and 2 of §8, establish an analogue of the regularity result in Proposi- 
tion 9.1 when P(D) is the (nonelliptic) “heat operator”: 


fa) oO? oO? 
= Ox (= _ sz): 


2. Give a detailed proof of (9.12), in order to deduce (9.11). 
(Hint: Use 


f €€(R"),g € C*(R”) = fxg eC”(R”). 


Break up fi and f2 into pieces. For nonsmooth pieces, establish and use 


supp yj C K; => supp 91 * y2 C Ki + Ko). 


10. Local solvability of constant-coefficient PDE 


In the previous sections we have mainly used Fourier analysis as a tool to provide 
explicit solutions to the classical linear PDEs. Here we use Fourier series to prove 
an existence theorem for solutions to a general constant-coefficient linear PDE 


(10.1) P(D)u= f. 


We show that, given any f € D’(R”), and any R < oo, there exists u € D’(R”) 
solving (10.1) on the ball |x| < R. This result was originally established by 
Malgrange and Ehrenpreis. If f € C'(IR"), we produce u € C™(IR”). We 
do not produce a global solution, and other references, particularly [H] and [Tre], 
contain much more information on solutions to (10.1) than is presented here. Our 
method, due to Dadok and Taylor [DT], does have the advantage of being fairly 
straightforward and short. 
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For any a € R”, solving (10.1) on Br = {x € R” : |x| < R} is equivalent to 
solving 


(10.2) P(D+a)jv=g, 

where v = e-***u and g = e-*** f. To solve (10.2) on Br, we can cut off 
g to be supported on B3R/2 and work on R"/2RZ". Without loss of generality 
(altering P(D)), we can rescale and suppose R = 7, so R"/2RZ”" = T”. The 
following result then implies solvability on Br. 

Proposition 10.1. For almost every a € A = {(a1,...,Q@n):0< a, < 1}, 
(10.3) P(D+a):D'(T") — D'(T”) 

is an isomorphism, as is P(D + a): C®(T") > C™*(T”). 


In view of the characterizations of Fourier series of elements of D’(T”) and of 
C™(T”), it suffices to establish the following. 


Proposition 10.2. Let P(€) be a polynomial of order m on R”. For almost all 
a € A, there are constants C, N such that 


(10.4) |P(kK+a) |< Clk)’, forallk eZ". 


We will prove this using the following elementary fact about the behavior of a 
polynomial near its zero set. 


Lemma 10.3. Let P(€) be a polynomial of order m on R", not identically zero. 
Then there exists 6 > 0 such that 


(10.5) [P(E)IM* € Lige(R”). 
Before proving Lemma 10.3, we show how it yields (10.4). First, we claim 


that, for any polynomial of order m on R”, not identically zero, there exist 6 > 0 
and M such that 


(10.6) / |P(E)|°(€)-™ dé < 00. 
Indeed, Lemma 10.3 guarantees 


|P(é)|7° dé < co, 


[g|<1 


while, for MV sufficiently large, 
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freremase f pirat’ as 
|€|>1 |g|<1 
and Lemma 10.3 also implies that, for 6 > 0 small enough, 
PUI MS) € Dine: 
Now, using (10.6), note that 
0.7) f [P+ ay) dase f [PEI a <0. 
“A kez R 
Thus, for almost all a € A, 


(10.8) S> |P(k+.0)|-8(k)-™ < 00, 
kezr 


which immediately gives (10.4). 

We now prove the lemma for any 6 < 1/m. We must prove that |P(£) s 
integrable on any bounded subset of R”. Rotating coordinates, we can suppose 
that P(&1, 0) is a polynomial of order exactly m: 


Ps 


(10.9) P(E1,0) =Gme™ +++» +00, Om #0. 
It follows that, with €’ = (&,...,&n), 


m1 


(10.10) P(E) = amet + S> ae(€)E&, 
£=0 
where ay(E’) is a polynomial on R”~! of order < m — £. Consequently, we have 


(10.11) P(é) = am |] (& — As(€’)). 


j=1 


Hence it is clear that, for any C1 < oo, there is a Cp < oo such that if 6 < 1/m, 


C1 
(10.12) / IP(E)|78 dé < Co, for |é’| < Ch. 
-—C, 


This completes the proof. 


Exercises 


1. Consider the following boundary problem on [0, A] x T”: 
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Utt — Au = 0, 
u(0, x) = fil), u(A, x) = fa(), 


where f; € C°(T”). Show that, for almost all A € R*, this has a unique solution 
wu € C™*([0, A] x T”), for all f; € C°°(T”). Show that, for a dense set of A, this 
solvability fails. 


11. The discrete Fourier transform 


When doing numerical work involving Fourier series, it is convenient to 
discretize, and replace S*, pictured as the group of complex numbers of modulus 
1, by the group I, generated by w = e?7‘/", One can also approximate T? by 
(r.y%, a product of d copies of I,,. We will restrict attention to the case d = 1 
here; results for general d are obtained similarly. 

The cyclic group [,, is isomorphic to the group Z, = Z/(n), but we will 
observe a distinction between these two groups; an element of I’, is a certain 
complex number of modulus 1, and an element of Z,, is an equivalence class of 
integers. For n large, we think of I’, as an approximation to S' and Z,, as an 
approximation to Z. We note the natural dual pairing T',, x Z,, — C given by 
(wi, £) ++ wI", which is well defined since w/” = 1. 

Now, given a function f : T’,, — C, its discrete Fourier transform f #—@, f, 
mapping Z,, to C, is defined by 


(11.1) f(y => So fiw. 
wieTy, 


Similarly, given a function g : Z, —> C, its “inverse Fourier transform” g? : 


C is defined by 


(11.2) gw) = YF 9(Ow™. 


LEZ, 
The following is the Fourier inversion formula in this context. 


Proposition 11.1. The map 
(11.3) &,,: 1? pn) — 7 (Zp) 


is a unitary isomorphism, with inverse defined by (11.2), so 


(11.4) (=> fhe", 
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Here the space L7(Z,,) is defined by counting measure and L?(T,,) by 1/n 
times counting measure, that is, 


a ior) 
(11.5) (u,v) n2(r,) : 2 u(w? )u(w!). 
Note that if we define functions e; on I’, by 
(11.6) ej(wh) =u, 
then Proposition 11.1 is equivalent to: 


Proposition 11.2. The functions e;, 1 < j <n, form an orthonormal basis of 
EAL). 


Proof. Since L? ([',) has dimension n, we need only check that the ej;S are 
mutually orthogonal. Note that 


Denote the sum by S,,,. If we multiply by w™, we have a sum of the same set of 
powers of w, so Sj, = w™ Si. Thus S,, = 0 whenever w™” 4 1, which com- 
pletes the proof. Alternatively, the series is easily summed as a finite geometrical 
series. 


Note that the functions e; in (11.6) are the restrictions tol’, of eJ9 (i.e., values 
at 9 = 2rk/n). These restrictions depend only on the residue class of j mod n, 
which leads to the following simple but fundamental connection between Fourier 
series on St and onI’,. 


Proposition 11.3. If f € C(S*) has absolutely convergent Fourier series, then 


(11.7) fRQ= S> f+ in). 


j=-o0 


We will use (11.7) as a tool to see how well a function on S' is approximated 
by discretization, involving restriction to I’,,. Precisely, we consider the operators 


(11.8) RatCS) S70), Bale, 3 Os) 


given by 


(11.9) (Rnf)(w’) = f (=) ; 
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for f = f(0), O< 6 < 2z, and 


yv-1 
(11.10) E,, 6 now) = So (Oe, n= 2. 


LEZ l=-v 


We assume n = 21 is even; one can also treat n = 2v — 1, changing the upper 
limit in the last sum from v — 1 to v. Clearly, R,E,, is the identity operator on 
L? (I). The question of interest to us is: How close is E,,R, f to f, a function on 
S12 The answer depends on smoothness properties of f and is expressed in terms 
involving (typically) negative powers of n. 

We compare FE, R,, and the partial summing operator 


v-1 
(11.11) Pes ge 


l=-v 


for Fourier series. Note that 


(11.12) EnRnf (0) = s fF Oe”. 
j= 
Consequently, 
(11.13) E,Rnf = Paf + Qnf, 
with 
(11.14) Qnf (9) = 9 [f*(0 - FO] 
=u 


(11.15) fF(Q)-fO= S- f+ In). 


Consequently, the sup norm of Q,, f is bounded by 


(11.16) ys If*F(Q)-fO] < S> FM. 


l=-v |k|>v 


The right side also dominates the sup norm of f — P,, f, proving: 
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Proposition 11.4. /f f € C(S*) has absolutely convergent Fourier series, then 


(11.17) If — EnRnflln~ <2 S> |f(¥)I.- 


|kl>v 


The estimates of various norms of f — P,,f is an exercise in Fourier analysis 
on $1. There are many estimates involving Sobolev spaces; see Chap. 4. Here we 
note a simple estimate, form > 1: 


lf —Prflleessy < >> lalLA()| 
(11.18) lk|>v 


< Cmell fll ce+m+1(g1) nm”, 
the last inequality following from (1.49). As the reader can verify, use of the 
proof of Proposition 1.3 can lead to a sharper estimate. As for an estimate of the 


contribution of Q,, to the discretization error, from (11.14) to (11.16) we easily 
obtain 


£ as 
lQnflloest < (= |f()| 
(11.19) el (>) Py 


< Com|| fllce+m+i¢giy + 


—m 


We reiterate that sharper estimates are possible. 
Recall that solutions to a number of evolution equations are given by Fourier 
multipliers on L?(S1), of the form 


(11.20) F(D)u(0) = S> F(aa(oe. 
We want to compare such an operator with its discretized version on L?(T’,,): 


y—-1 
(11.21) rip] ao" = So F(0)g(0)w. 
LEZn f=-v 
In fact, a simple calculation yields 
v—1 
(11.22) EnF(Dn)Rnu(®) = >> F(Q)u* (He 


e=-v 


and hence 


(11.23) E,F(Dn)Rnt = PyF(D)u+ Vnu, 
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where 

v—-1 ; 
(11.24) V,u(d)= S> F(A) S> a(e+jn)|e. 

7 jEZ\O 
This implies the estimate 
(11.25) |Wnullc~ < | sup |F(Q|] S~ la(é)]. 

|e|<v |k|>v 

Also, as in (11.18), we have, form > 1, 
(11.26) |W nullcecsty < Com he |F'(é)| ||20| ce+m4i(g1) nm 


The significance of these statements is that, for u smooth and n large, the 
discretized F'(D,,) provides a very accurate approximation to F'(D). This is of 
practical importance for a number of numerical problems. 

Note the distinction between D,, and the centered difference operator A,,, 
defined by 


(Anf)(w) = = [fwi) — fe). 
We have, in place of (11.21), 


(11.27) ~— F(An) 


ys nor = = v (sin (=*)) g (Lu, 


LEZn 
so, for g? € L?([',) given by (11.2), 
(11.28) 


F(dn)gh(w’) FD aw) =F [F(Z sm (224)) ro] (00 


f=-v 


This identity leads to a variety of estimates, of which the following is a simple 
example. If |F’(A)| < K for —v << v, then 


2 v-1 
(11.29) ||F(A,,)u — F(D,,)ul|z0 < : WK bs \e\° mol ea 


l=-—v 
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since, for —7 < x < 7, |sinz — | < (1/6)|a|. The basic content of this is that 
F(A,,) furnishes a second-order-accurate approximation to F'(D) (as n — oo). 
This is an improvement over the first-order accuracy one would get by using a 
one-sided difference operator, such as 


(ALN) = == [Fe) — Fw), 


TU 


but not as good as the “infinite-order accuracy” one gets for F'(D,,) as a 
consequence of (11.23)-(11.26). 
Similar to the case of functions on S', we have, for u € L?(T,,), 


(11.30) F(D,y)u(w’) = (kp * u)(w?) =y kp ( wx =F (w*) 
" (Ln 
where 
y—1 
(11.31) ket) = SO F(Qw*. 
t=-v 
For example, with F(A) = e~¥!4!, we get the discrete version of the Poisson 
kernel: 
y—-1 
(11.32) kr(w) = py(w) = S- ells! 
l=—-v 


which we can write as a sum of two finite geometrical series to get 


Lp Det costa fa) oe ty 
+ TW F 
1+ r? — 2rcos(21j/n) 


(11.33) py(w) = 


with r = e~” and, as usual, w = e?7*/", n = 2v. Compare with (1.30). The 
reader can produce a similar formula for n odd. 

As in the case of $1, the sum (11.31) for the (discretized) heat kernel, with 
F(£) = ef , cannot generally be simplified to an expression whose size is inde- 
pendent of mn. However, when ¢ is an imaginary integer, such an evaluation can 
be performed. Such expressions are called Gauss sums, and their evaluation is 
regarded as one of the pearls of early nineteenth-century mathematics. We present 
one such result here. 


Proposition 11.5. For any n > 1, even or odd, 


n-1 
(11.34) S* etrikt/ne 2nilk/n _ sl tie — nil? /2n [1 (= 1i-”| ni/2. 
k=0 
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Proof. The sum on the left is n - f*(—@), where f € C(S") is given by 
f(0) =e" 0 <0 < Or. 


Note that f is Lipschitz on $1, with a simple jump in its derivative, so f (k) = 
O(|k|~?). Hence Proposition 11.3 applies, and (11.7) yields 


co 1 
(11.35) fos ; e2minly?+(+e/n)u] gy, 
j=—o0% 0 


To evaluate this, we use the “Gaussian integral” (convergent though not absolutely 
convergent): 


oa 1 
(11.36) / e2rin” dy — Vey y= 5 (1 +4), 


—oco 


obtained from (3.20) by a change of variable and analytic continuation, as 
in (6.42). We will break up the real line as a countable union of intervals 
U, (k +a,k+a-+ I], in two different ways, and then evaluate (11.35). Note that 


kt+a+1 os ak : 5 ; P 
(11.37) / e2miny dy = | e2rinty +2(k+a)y] dy , e2tin(k+a) ; 
k+a 0 


If we pick a = ¢/2n, then 2(k + a) = 2k + £/n, and as k runs over Z, we get 
those integrands in (11.35) for which 7 is even. If we pick a = —1/24 £/2n, then 
2(k + a) = 2k —1+ €/n. Furthermore, we have 


(11.38) e2tin(k+a)? _ eril?/2n ang er i(€—n)?/2n 


respectively, for these two choices of a. Thus the sum in (11.35) is equal to n~!/?- 
times e~ 7/20 + e7ti(e-n)?/2n_ which gives the desired formula, (11.34). 


The basic case of this sum is the @ = 0 case: 


(11.39) er = 5 (1 +i)(14i-")nl?? =o, - nl/?, 

where o,,, is periodic of period 4 in n, with 

(11.40) oo =1+i, o1=1, o2 =0, 03 =%. 

This result, particularly when n = p is a prime, is used as a tool to obtain fas- 


cinating number-theoretical results. For more on this, see the exercises and the 
references [Hua, Land, Rad]. 
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Exercises 


1. Generalize the Gauss sum identity (11.34) to 


n—-1 . 
Qnik?m/n 2rilk/n 1+i 
e e =—.— 

m 


n a —nil? /2mn 
e 
2 


(11.41) 


2m—-1 
x 1 en tine? /2m o—mive/m 
v=0 
(Hint: The left side is n - f*(—£), with 
sere, He pea 


For this, one has a formula like (11.35): 


= [« arinmly?+(L/m\G+E/™)U) dy, 


j=—oco 
Write 7 = 2m + v,s0 
[e) 2m-1 
Ee a Da 
j=-c v=0 j=v mod 2m 


For fixed v, the sum becomes a multiple of the Gaussian integral (11.36), with n 


replaced by nm, and the formula (11.41) arises.) 
Note the @ = 0 case of this: 


ye Qnik?m/n = 1+% (= i oS eo mine? [2m 
2 \m 


v=0 


2. Let A be d?/dz? on S? = R/(27Z). Using Fourier series, show that, for t = 21m/n, 
where m and n are positive integers, the solution operator e~“6(x) = H(t, x) to the 
Schrodinger equation has the form 


n-1 
(11.42) H (an, x) = S~ G(m, n, £) banejn(2), 
£=0 


where G'(m, n, £) is given by the left side of (11.41). On the other hand, applying e~ 
acting on S’(R), to }>,, 6(a — 27v), show that (11.42) holds, with G(m, n, £) given 
by the right side of (11.41). Hence deduce another proof of this Gauss sum identity. 


it 


Remark. Applications of the Schrédinger equation on a flat torus to multivariable Gauss 


sums are given in [Tay3]. 
3. Let #(£, ) denote the number of solutions k € Zn to 


£=k? (modn). 


Show that, with w = e27*/”, 
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n—-1 n-1 
oe or = Ds #(0,n)w!*. 
k=0 £=0 


4. Show that, more generally, 


n-1 ys n-1 
(So) =F tema 
k=0 
where #(£, 1; 1) denotes the number of solutions (k1,..., ky) € (Zn)” to 
2=ki+---+k? (modn). 


5. Let p be a prime. The Legendre symbol (¢\p) is defined to be +1 if £ = k? mod p for 
some k and £ # 0, 0 if £ = 0, and —1 otherwise. If p is an odd prime, #(¢,p) = 
(€\p) + 1. The Legendre symbol has the useful multiplicative property: (£1l2|p) = 
(€;|p)(€2|p). Check this. Show that, with w = e?”"/?, if p is an odd prime, 


p-l 5 p-l 
Ve = Sep)", 
k=0 £=0 
and, more generally, 
p-l _ p-l ; 
Sow = So lp)w* + pd;0, 
k=0 £=0 


where 6j;9 = 1 if j = 0 (mod p), 0 otherwise. (Hint: Use Exercise 3.) 
6. Denoting aa we? by Gp, pan odd prime, show that 


p-1 
re) 
Sow = (lp) - Gp + p- dpo. 
k=0 
(Hint: If 1 < 7 < p—1, use ero (Elp)w* = Po (Je|p)w* and (jé|p) = 


(lp) (4lp).) 


Denote by S(m,n) the Gauss sum 


n-1 2 
S(m,n) =. s e2tik min 
k=0 


Then the content of Exercise 6 is that S(j,p) = (j|p)S(1, p), for 1 < 7 < p—1, when 
p is an odd prime. 
7. Assume p and q are distinct odd primes. Show that 


S(1, pq) = S(q,p)S(p, @)- 


(Hint: To resum So: e2tik? /pa use the fact that, as yw runs over {0,1,...,p-1} and v 
runs over {0,1,...,q — 1}, then & = ug + vp runs once over Z mod pq.) 
8. From Exercises 6 and 7, it follows that when p and q are distinct odd primes, 
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S(1, pq) 
S(1,p)S(1,q)° 


Use the evaluation (11.39) of S'(1,) to deduce the quadratic reciprocity law: 


(pla)(alp) = 


(pla)(alp) = (Ie PO. 
This law, together with the complementary results 
(-1p) = (NP, (lp) = (1-9, 
allows for an effective computation of (|p), as one application, but the significance 


of quadratic reciprocity goes beyond this. It and other implications of Gauss sums are 
absolutely fundamental in number theory. For material on this, see [Hua, Land, Rad]. 


12. The fast Fourier transform 


In the last section we discussed some properties of the discrete Fourier transform 


1 
#(~) = — I) uy dt 
(12.1) PO=— dy) few, 
wiEDy 
where € € Z, = Z/(n) and T,, is the multiplicative group of unit complex 


numbers generated by w = e?‘/", We now turn to a discussion of the efficient 


numerical computation of the discrete Fourier transform. Note that, for any fixed 
£, computing the right side of (12.1) involves n—1 additions and n multiplications 
of complex numbers, plus n integer products 7 = m and looking up w” and 
f (w’). If the computations for varying @ are done independently, the total effort 
to compute f* involves n? multiplications and n(n — 1) additions of complex 
numbers, plus some further chores. The fast Fourier transform (denoted FFT) is 
a method for computing f* in Cn(log n) steps, in case n is a power of 2. 

The possibility of doing this arises from observing redundancies in the 
calculation of the Fourier coefficients f#(@). Let us illustrate this in the case 
of 4. We can write 


(12.2) 4f* (0) = [F0) + F@)] + [FO + F@)], 
| 4f#(2) = [1 + F@)] - [1 + F@)], 
and 
cid = i2 : A ; 
(12.3) 4f*(1) = [f0) — f(@)] -i[f@ —F@), 


4f*(3) = [f0) -— £@)] +if@ — £@)]. 
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Note that each term in square brackets appears twice. Note also that (12.2) gives 
the Fourier coefficients of a function on 2; namely, if 


(12.4) "FU =f +F-1), °F(-1) = fO+ FH), 
then 

(12.5) 2f* (20) =°f#(£), for 2=Oorl. 
Similarly, if we set 

(126) = *f()=FQ)-F(-1),  *F(-1) =-é[F@ - F@)], 
then 

(12.7) 2f#(20+1)=1f*(2), for? =Oorl. 


This phenomenon is a special case of a more general result that leads to a fast 
inductive procedure for evaluating the Fourier transform f*. 

Suppose n = 2*; let us use the notation G, = I. Note that G;,_1 is a 
subgroup of G;. Furthermore, there is a homomorphism of G; onto G,._1, given 
by wi + w, Given f : G, — C, define the following functions °f and! f on 
Gr_1, with w, = w?, generating Gp_1: 


(12.8) °F(wt) = f(w3) + fw”), 
j 
1 


(12.9) 1 f(w]) = 07 [f(w?) — fwt/?)]. 


Note that the factor 7 in (12.9) makes ! f (wi) well defined for 7 € Z,,/2, that is, 
the right side of (12.9) is unchanged if j is replaced by j + n/2. Then ° f*# and 
1 ¢#, the discrete Fourier transforms of the functions ° f and ! f, respectively, are 
functions on Zp, /2 = Z/(2*~*). 


Proposition 12.1. We have the following identities relating the Fourier trans- 


forms of °f, 'f, and f: 


(12.10) 27a = fre) 
and 
(12.11) 2f*(20+1) = fF (0), 


for £€ {0,1,...,n/2 — 1}. 


Proof. Recall that we set w, = w?. Since w” = 1 and oe = 1, we have 
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dD fw!\a 


wiEG, 


nf* (2) 


(12.12) . 
= [Pw + fet) wt, 


wi =w2I EG a1 
proving (12.10), and, since w"/? = —1, 


nf*(2€+1) = S- f(wi)o? w78 
wiEGr 


(12.13) 


I 


~ w[Fw) - perry] af, 


wt =w7IEGp-1 


proving (12.11). 


Thus the problem of computing f*, given f € L?(G;), is transformed, after 
n/2 multiplications and n additions of complex numbers in (12.8) and (12.9), 
to the problem of computing the Fourier transforms of two functions on Gx_1. 
After n/4 new multiplications and n/2 new additions for each of these functions 
°f and 'f, that is, after an additional total of n/2 new multiplications and n 
additions, this is reduced to the problem of computing four Fourier transforms 
of functions on G,_». After k iterations, we obtain 2* functions on Go = {1}, 
which precisely give the Fourier coefficients of {. Doing this hence takes kn = 
(log, n)n additions and kn/2 = (log, n)n/2 multplications of complex numbers, 
plus a comparable number of integer operations and fetching from memory values 
of given or previously computed functions. 

To describe an explicit implementation of Proposition 12.1 for a computation 
of f*#, let us identify an element € € Z, (n = 2") with a k-tuple L = 
(Ly—1,---, £1, Lo) of elements of {0,1} giving the binary expansion of the inte- 
ger in {0,..., — 1} representing @ (i.e., Lp + L1-2+---+ Lp_1- 2k 1 = 2 
mod n). To be a little fussy, we use the notation 


(12.14) f¥(Q = fFF(L). 


Then the formulas (12.10) and (12.11) state that 


(12.15) OFF? (Lge cs ac sO) = FPF eens Ta) 
and 
(12.16) Of (De aosacadgy lye (ety ceed la 


The inductive procedure described above gives, from ° f and! f defined on Gy_,, 
the functions 
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(12.17) F=f), MpatP), MPa(~), MPa t(P) 
defined on G;,_2, and so forth, and we see from (12.15) and (12.16) that 
(12.18) froO= ~ aq, 


where / f = " f (1) is defined on Go = {1}. From (12.8) and (12.9) we have the 
following inductive formula for Emtiblm D1 f on Geem-1: 


OLmLa f (wi.41) = mela f (wi,) bm Ls f Ca . 
(12.19) 


Ligh aj [Lm L j ) Lime L jt 2h—™—1 
f (whys) =e, [Porm eg (wh) Per (we), 


where w,, is the generator of Gy—m, defined by wo = w = €?7*/" (n = 2"), 
Wr Swe, that i, i, = Ww. 


When doing computations, particularly in a higher-level language, it may be 


easier to work with integers @ than with m-tuples (L1,..., £1). Therefore, let 
us set 

(12.20) Lmeta f(a) = Fm (2-9 +8), 

where 


0S 1y + Dg 2420+ + Lg 2" € {0,1,.2.,27 — 1} 
and 
j€ {0,1,...,2°°-™-1}. 
Note that this precisely defines F,,, on {0,1,...,2* — 1}. For m = 0, we have 


(12.21) Fo(j) =f (w?), 0<7<2*-1. 


The iterative formulas (12.19) give 


(12.22) 
Fingt (2°97 + €) = Fn O™7 +O) + Fn (277 +2" 7 +28), 


Fing1 (27 +2415 40) = 0, [Fin (2°97 +9 — Fin (2™7 +2" 14-4], 
for0 <j < 2k-™-1!_1, 0 < ¢< 2™—1. It is easy to write a computer program 
to implement such an iteration. The formula (12.18) for the Fourier transform of 


f becomes 


(12.23) ff ®=n'F(O, 0<2< 2" -1. 
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While (12.21)-(12.23) provide an easily implementable FFT algorithm, it is 
not necessarily the best. One drawback is the following. In passing from F;,, to 
Fy41 Via (12.22), you need two different arrays of n complex numbers. A variant 
of (12.19), where ’m+14"/1 Ff is replaced by f41'"4™/m+1, leads to an iterative 
procedure where a transformation of the type (12.19) is performed “in place,” 
and only one such array needs to be used. If memory is expensive and one needs 
to make the best use of it, this savings can be important. At the end of such an 
iteration, one needs to perform a “bit reversal” to produce f*. Details, including 
sample programs, can be found in [PFTV]. 

On any given computer, a number of factors would influence the choice of the 
best FFT algorithm. These include such things as relative speed of memory access 
and floating-point performance, efficiency of computing trigonometric functions 
(e.g., whether this is implemented in hardware), degree of accuracy required, 
and other factors. Also, special features, such as computing the Fourier trans- 
form of a real-valued function or of a function whose Fourier transform is known 
to be real-valued, would affect specific computer programs designed for maxi- 
mum efficiency. Working out how best to implement FFTs on various computers 
presents many interesting problems. 


Exercises 


1. Write a computer program to implement the FFT via (12.21)-(12.23). Try to make it 
run as fast as possible. 

2. Using the FFT, write a computer program to solve numerically the initial-value problem 
for the heat equation 0u/Ot — uaz = 0 on Rt x s?. 

3. Consider multidimensional generalizations of the discrete Fourier transform, and in 
particular the FFT. What size three-dimensional FFT could be handled by a computer 
with 4 megabytes of RAM? With 256 MB? 64 GB? 32 TB? 

4. Generalize the FFT algorithm to a cyclic group I’, with n = 3*. Also, generalize to 
the case n = pi --- px where p; are “small” primes. 


A. The mighty Gaussian and the sublime gamma function 


The Gaussian function e~!#!” on R” is an object whose study yields many won- 
derful identities. We will use the identity 


(A.1) petPae = nl? 


Rv 


which was established in (3.18), to compute the area A,,_; of the unit sphere 
S”~1 in R”. This computation will bring in Euler’s gamma function, and other 
results will flow from this. Switching to polar coordinates for the right side of 
(A.1), we have 
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oe 2. 
nr? — Ana f e” rl dr 
0 


(A.2) ea | ent en/2-1 gy 
0 


where the gamma function is defined by 


(A.3) rGy= | er a, 
0 
for Re z > 0. Thus we have the formula 
) n/2 
(AA) An-1 = 4 
T (5n) 


To be satisfied with this, we need an explicit evaluation of [(n/2). This can be 
obtained from ['(1/2) and I'(1) via the following identity: 


reti= [ete at 

0 

A5 ed Naa 
= 2I(z), 


for Re z > 0, where we used integration by parts. The definition (A.3) clearly 
gives 


(A.6) T(1) = 1. 

Thus, for any integer k > 1, 

(A.7) T(k) = (kK-1)T(kK-1) =---=(k-1)!. 

Note that, for n = 2, we have A; = 27/T(1), so (A.6) agrees with the fact that 
the circumference of the unit circle is 27 (which, of course, figured into the proof 


of (3.18), via (3.20). In case n = 1, we have Ag = 2, which by (A.4) is equal to 
2n'/?/T(1/2), so 


1\ aye 
(A.8) r(5)=n 


Again using (A.5), we see that, when k > 1 is an integer, 
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1 1 1 
T —~)/=(k T(k =. 
(+2) = (ea) F(a) 
1 3 1 1 
as = (#5) (a) GPG) 


In particular, I'(3/2) = (1/2)'\(1/2) = 1/?/2, so Ag = 2n9/?/(21/?/2) 
= 4m, which agrees with the well known formula for the area of the unit sphere 
in R3. 

Note that while '(z) defined by (A.3) is a priori holomorphic for Re z positive, 
the equation (A.5) shows that ['(z) has a meromorphic extension to the entire 
complex plane, with simple poles at z = 0, —1, —2,.... It turns out that ['(z) has 
no zeros, so 1/T'(z) is an entire analytic function. This is a consequence of the 
identity 

T 


(A.10) (2rd —2) = 


sin rz’ 


which we now establish. From (A.4) we have (for 0 < Re z < 1) 


T(2\P(1 — 2) = | | en (+t) 248-1 ay a 
0 0 
(A.11) =| i: e“y* (1+ v)7! du dv 
0 0 


=| (1+ ¥)~1y?-! dv, 


0 


where we have used the change of variables u = s + t, v = t/s. With v = e”, 
the last integral is 


(A.12) i (427) eo ax, 


which is holomorphic for 0 < Rez < 1, and we want to show that it is equal 


to the right side of (A.10) on this strip. It suffices to prove identity on the line 
z=1/2+7€, € © R; then (A.12) is equal to the Fourier integral 


(A.13) I (2cosh ie ei de. 


—Co 


To evaluate this, shift the contour of integration from the real line to the line 
Im x = —2z. There is a pole of the integrand at x = —77, and we have (A.13) 
equal to 
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[oe —1 
(A.14) = / (2 cosh =) e2™eit§ dz — Residue « (277). 


Consequently, (A.13) is equal to 


. Residue T 
a => 
1+e?™§ ~ cosh7é’ 


(A.15) Qn 


and since 7/sinm(1/2 + i€) = m/cosh7€, the demonstration of (A.10) is 
complete. 

The integral (A.3) and also the last integral in (A.11) are special cases of the 
Mellin transform: 


(A.16) Mf(z) = | fOr dé. 
0 

If we evaluate this on the imaginary axis: 

(A.17) M# f(s) = i for ad, 
0 


given appropriate growth restrictions on f, this is related to the Fourier transform 
by a change of variable: 


(A.18) M* f(s) = f(e*)e** dz. 


The Fourier inversion formula and Plancherel formula imply 


(A.19) f(r) =n) [Wat pia) ds 
and 

7 #(s)2 ds =(2n) | [f(r) Pr ar. 
(a2) ff Mss) ds =n) f° (f(r) Prd 


In some cases, as seen above, one evaluates M f(z) on a vertical line other than 
the imaginary axis, which introduces only a slight wrinkle. 

An important identity for the gamma function follows from taking the Mellin 
transform with respect to y of both sides of the subordination identity 


(A.21) ews = sunt? 1. env? /At tA? 3/2 ay 
0 
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(if y > 0, A > O), established in §5; see (5.22). The Mellin transform of the 
left side is clearly [(z)A~*. The Mellin transform of the right side is a double 
integral, which is readily converted to a product of two integrals, each defining 
gamma functions. After a few changes of variables, there results the identity 


(A.22) ml D (22) = 2 D(2)P (2 + 5) 


known as the duplication formula for the gamma function. In view of the 
uniqueness of Mellin transforms, following from (A.18) and (A.19), the identity 
(A.22) conversely implies (A.21). In fact, (A.22) was obtained first (by Legendre) 
and this argument produces one of the standard proofs of the subordination 
identity (A.21). 

There is one further identity, which, together with (A.5), (A.10), and (A.22), 
completes the list of the basic elementary identities for the gamma function. 
Namely, if the beta function is defined by 


1 1 
(A.23) B(z,y) = i s*—*(1—s)#* ds = | (L+u)-* Yu"! du 
0 0) 
(with u = s/(1 — s)), then 
(A.24) Big ew, 


To prove this, note that since 


(A.25) T'(z)p-* = | eh di, 
) 
we have i so 
1 Ey _ —(+ujtpe+y-1 dt 
aru = rea f° 7 
so 


1 co co 
B Be tpety oF ut, 2—1 du dt 
(x, y) he+y) | e€ é eu U 


_ I'(z) os y-1 
> eee = 


as asserted. 
The four basic identities proved above are the workhorses for most applications 
involving gamma functions, but fundamental insight is provided by the identities 
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(A.27) and (A.31) below. First, since 0 < e~* — (1 —t/n)" < et - t?/n for 
0<t<n, we have, for Re z > 0, 


T(z) =| ate dt 
) 


(A.26) 


lI 
=o 
°o 
—~ 
me 
| 
[om 
VY 
3 
nw 
x 
| 
fa 
Q 
oF 


1 
= lim w | (1 —s)"s*—* ds. 
0 


n—-co 


Repeatedly integrating by parts gives 


| 1 
T(z) = lim n’ nS) / Beret ae, 
nco (z+ 1)---(z2+n-1) Jo 


which yields the following result of Euler: 


(A.27) T(z) = lim n’ 


Using the identity (A.5), analytically continuing ['(z), we have (A.27) for all z, 
other than 0, —1, —2,.... We can rewrite (A.27) as 


-1 -1 
(A.28) T(z) = lim n* 27(1+2)7} (1 i =) ee (1 +4 =) 
n—0o nm 
If we denote by y Euler’s constant: 
. 1 1 
(A.29) y= lim {[14+=+---+—-—logn], 
noo 2 r 
then (A.28) is equivalent to 
(A.30) 
z\-l z\-l 
T(z) = lim ei eet tin) gt oe z)t (1 a =) hat (1 + =) F 
n—Co n 
that is, to the Euler product expansion 
(A.31) Eager Il (1 ns =) en 2ln, 
T(z) or n 


It follows that the entire analytic function 1/T'(z)I'(—z) has the product expansion 
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(A.32) ORD = —z? I (1 = =) ; 


Since [(1 — z) = —zI'(—z), by virtue of (A.10) this last identity is equivalent to 
the Euler product expansion 


; _ a 
(A.33) sin tz = 1z Il (1 — =) i 


n=1 


It is quite easy to deduce the formula (A.5) from the Euler product expansion 
(A.31). Also, to deduce the duplication formula (A.22) from the Euler product 
formula is a fairly straightforward exercise. 

Finally, we derive Stirling’s formula, for the asymptotic behavior of '(z) as 
z —» +00. The approach uses the Laplace asymptotic method, which has many 
other applications. We begin by setting t = sz and then s = e” in the integral 
formula (A.3), obtaining 


T(z) = a f eg *e—logs) gl da 
0 


= a f e2e"—¥) dy, 


The last integral is of the form 


(A.34) 


(A.35) / en 20) dy, 
where y(y) = e” — y has a nondegenerate minimum at y = 0; y(0) 


= 1, 
yp (0) = 0, y”(0) = 1. If we write 1 = A(y) + By), A € CH°((—2,2)), 
A(y) = 1 for |y| < 1, then the integral (A.35) is readily seen to be 


(A.36) 7 A(y)e~7? dy a O(e~ G42), 


We can make a smooth change of variable x = €(y) such that €(y) = y + O(y?), 
y(y) = 1+ x?/2, and the integral in (A.36) becomes 


(A.37) e# / Ay(x)e~2*"/? dz, 


where A; € C5°(R), Ai (0) = 1, and it is easy to see that, as z + +00, 


oo P 2 1/2 
way aloe"Parn (EY began) 
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In fact, if z = 1/2t, then (A.38) is equal to (4mt)1/?u(t, 0), where u(t, x) solves 


the heat equation, uz — Ur. = 0, u(0,2) = A(x). Returning to (A.34), we have 
Stirling’s formula: 


(A.39) [(z) = ()" ze * [14+ O(z7")]. 


Since n! = ['(n + 1), we have in particular that 
(A.40) n! = (2nn)'/? n” e~” [1 + O(n—})] 
as 2 —> 00. 
Regarding the approach to the Laplace asymptotic method taken here, compare 


the derivation of the stationary phase method in Appendix B of Chap. 6. 


Binomial coefficient asymptotics 


For n € N,k € {0,...,}, we have the binomial coefficients 
n n! 
A41 = ees 
( ) (;) k(n — k)! 
In fact, using z! = I'(z + 1), we can define such coefficients for n,k € Rt, 


0<k <n. We will examine the behavior of 


n n! 
(A.42) bn(a) = ( ) => Ai 
(n/2ita)/ (sd +2))(g0 — 2)! 
for large n, with —1 < x < 1, using the Sterling formula, which we rewrite as 


(A.43) zl = e* 8 2-*,/9nz(1+O(z2-+)), 2> 0. 


Straightforward substitution of n and (n/2)(1 + x) into (A.43) gives 
(A.44) 


bn(a) = yf 2 Bap? [1+ 2) os 72 + 2) log 
x (1+ O((n( = 2?))~)). 


In particular, 


(A.45) a) = b,(0) = jar a: O(n"). 


It is natural to normalize, obtaining 
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(A.46) Qn(o) = = — 


eH) (14+ O((n{l — 2?))"")), 
where 
(A.47) w(az) = (1+ 2) log(1+ 2) + (1 — 2) log(1 — 2). 


We have 


(A48) (xz) > 0, for -1<2<1, (0) =0, (41) =2log?2. 


The asymptotic relation (A.46) has a relatively small remainder as long as —1 < 
x < land 


(A.49) tae 


B. The central limit theorem 


In this appendix, we show how Fourier analysis applies to establish an important 
result in probability theory known as the central limit theorem. To set things up, 
suppose (Q, F, js) is a probability space (Qa set, F a o-algebra, ju a probability 
measure) and that {f;} is a sequence of (real valued) independent, identically 
distributed random variables on 2, with mean 0 and variance o, so 


(B.1) f, € 17(Q,p), | fiau=o. | Bau=o>o. 
Q Q 


In such a case, the independence implies 


(B.2) (fi, fyi = 0, for 2 # j. 
The weak law of large numbers says that, as k} > oo, 
k 


bari —>0, in L?-norm. 


j=l 


(B.3) Sy = 


a | 


The proof is simple: 


ee ee o 
(BA) os poe Doe =p 
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A standard presentation of the weak law says that S; — 0 in measure, which 
follows from (B.3) (or better, from (B.4)), via Chebychev’s inequality. 

Kolmogoroff’s strong law of large numbers produces pointwise a.e. conver- 
gence, and relaxes the L? hypothesis, down to L! (and then yields L'-norm 
convergence), but we will not be concerned with that here. (Cf. Chapter 15 of 
[Tay2] for a treatment, making a connection to Birkhoff’s ergodic theorem.) 

To proceed, each real-valued random variable f on 2 induces a probability 
measure vy on R, given by 


(B.5) vz(S) = w(f-*(S)), 


when S C R is a Borel set. Note that 


FELO,n) <> | a] dvyj(0) < 00, 


(B.6) 2 
[ ftn= fader 


Similarly, 


(B.7) is qu= fo? dv;(z), 


2 R 


and, more generally, for p € [1, co), 

(B.8) fuse du = / |x|? duvp(a). 
co) R 

Given f as above, the function 

—igf dj 


(B.9) ete dvs (ax 


a 


= Wi 2nd5(§) 
is called the characteristic function of f. If { f;} are independent, then 


k 
(B.10) Ge= >) fp = xe, ©) = xn >> xn (6: 


iat 
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A special class of probability distributions on R, called centered Gaussian dis- 
tributions, has the form 


1 
(B.11) n° (4) = ae 2”, 
210 


One computes 


(B.12) [evo dx = 0, io ¥° (x) dz =o. 


A random variable f on (Q,F, 1) is said to be Gaussian if vy is Gaussian. A 
standard Fourier transform calculation gives 


(B.13) Vary (€) = a7 8"/2, 

Hence f : 2. — R is Gaussian with mean 0 and variance o if and only if 
(B.14) x9(Q) = PE? 

We note that 

(B.15) a ee ee EE 
and that if f; are independent, centered Gaussian random variables on Q, then the 
sum Gy = fi +--+ + fx is also Gaussian. 

Gaussian distributions are often approximated by distributions of the sum of a 
large number of IID random variables, suitably rescaled. Theorems to this effect 
are called Central Limit Theorems. As stated in the opening paragraph of this 
appendix, our goal is to present such a result here. 


Given that {f;} is IID and satisfies (B.1), the appropriate rescaling of f; + 
-++-+ f; is suggested by the computation (B.4). We have 


k 
1 
(B.16) i= oo = |lgellZ2 = 0. 


Note that if 1, is the probability distribution of f; (hence of f; for all 7), then for 
any Borel set B C R, 


(B.17) Vy,(B) = ve(VEB), vp = v4 * +++ * 4 (k factors). 


Note that 


(B.18) fe dv; =o, [oa =0. 
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Our goal is to prove the following version of CLT: 


Theorem B.1. Assume {f; : 7 € N} is an IID sequence on (Q,F, 1), with mean 
zero, and satisfying Il Fillz2ca yu) =o Set 


1 k 
B.1 . SS q 
(B.19) == Df 


and define y° as in (B.11). Then 

(B.20) Ug, —> 7, weak" in M(R), 

Here R = RU {oo}, so 

(B.21) C(R) = {u € C(R) : u(x) > ugg as |z| > co}, 
and M(R) is the space of finite signed measures on R, so 


(B.22) M(R) = C(RY. 


REMARK 1. The weak* convergence (B.20) means 
(B.23) [fe — [re dx, 


for each f € C (R). Since Vy, are finite positive measures, and 7° is absolutely 
continuous on R, it is an automatic consequence that (B.23) holds whenever /f is 
a bounded Borel function that is Riemann integrable on R ~ S!. See Appendix 


C below for a brief discussion of this fact. 


REMARK 2. In contrast to the law of large numbers, the central limit theorem does 
not assert that {g;,} converges to a random variable on © that is Gaussian with 
variance o. In fact, the set {a—1/? f;} forms an orthonormal basis of a Hilbert 
space H C L?(Q, 1), and each gj is an element of 1, and so is any limit. But, for 
each fixed 7, 


(B.24) jim, (fi, 9k)L2 = 9, 
so in fact, as k — oo, 


(B.25) gk —> 0, weakly in L?(Q, 1). 
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Proof of Theorem B.1. Applying the Fourier transform to the convolution identity 
in (B.17) yields 


(B.26) Xeon (6) = (R776) *, 

where y(£) = V27?1(€). By (B.6)-(B.7) applied to (B.1), and the fact that the 
Fourier transform intertwines multiplication by x and id/dé, and that the Fourier 
transform of a finite measure is a bounded, continuous function, we have 

(B.27) x€eC?(R), yx(0)=0, x"(0) =-o. 

Hence 


(B28) x(Q)=1- Fe’ +r(), r(E) = o(€"), as E40. 


Equivalently, there exists a > 0 such that, for |&] < a, 


(B.29) y(6) = en 7/2478) BE) 40, as € 0. 
Hence 

(B.30) Xo (6) = en LATER 78) for JE] < akl/?, 
with 

(B.31) B(k-/?€) +0 ask 00, VEER. 
Therefore, 

(B.32) jim (8) = 776), VEER. 


Now the functions ,,(€) are uniformly bounded by 1/./27. Making use of 
(B.32), the Parseval identity for the Fourier transform, and the dominated con- 
vergence theorem, we obtain for each v € S(R) (the Schwartz space of rapidly 
decreasing functions) that 


(B.33) 4 / 5(€)47 (6) dé 


An equivalent statement is that 
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(B.34) Vg, —> y? in S’(R), 


where S’(R) denotes the Schwartz space of tempered distributions. However, 
since {v,, : k € N} is bounded in M(R) and S(R) is dense in 


(B.35) C,(R) = {u € C(R) : u(oo) = 0}, 


we also have 


(B.36) [ede > per dz, 


for all v € C,.(R). Clearly (B.36) also holds for v = 1, so we have the conclusion 
(B.20). 


We can strengthen the conclusion of Theorem B.1, by using 


(B.37) ve dvy,(x) = ||gnll2.2 = 0. 

In particular, 

(B.38) {(1+22)v,, : k € N} is bounded in M(R), 

and we have from (B.34) that 

(B.39) (l+27)vg, — (1+ 27), 

in S’(R), hence in C,,(R)’, and then, by (B.37), in C(RY. This gives: 
Proposition B.2. In the setting of Theorem B.1, we have 

(B.40) (1+27)v,, <> (1+27)7%, weak*in M(R). 


Gaussian random variables will play a major role in Chapter 11, which deals 
with Brownian motion and other diffusion processes, related to the heat equation. 


Random walks and binomial coefficients 
The basic precursor to Brownian motion is the random walk, on Z, starting at 
0, which works as follows. At each time n € Z*, the walker has an even chance 


of moving one step to the left or one step to the right. This fits into the scenario 
analyzed above, with 


(Oy + 6-1), 


NlrR 


(B.41) Up = 
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and Theorem B.1! and Proposition B.2 apply. On the other hand, in this case there 
is a formula for the probability distributions of the positions x, at time n, given 
in terms of the binomial coefficients, via 


n_ “ n k pn-k 
(B.42) (LR) = (Ee Ree, 
k=0 
We have 
rob(tn = £) = 27” @ ooo" ope } 
In a = ? = ? 3 ? 
(B.43) k; 2 


0, otherwise. 


We can use the asymptotic formula for 


(B.44) by(x) = Cee j2| <1, 


produced in the preceeding appendix to analyze the large n asymptotics of the 
distribution of (B.43). Recall that 


bn(x) 1 —(n x u 
BAD) Qal)= Foy = Gage [1+ O(a) | 


with (a) given by (A.47)-(A.48). We also have 7)’"(x) = 2/(1 — x”), and we 
can write 


(B.46) W(a) = 2?(1+ 27a(x?)), a@>0 on (0,1), 


with a € C'™([0,1)). We look at the scaled function 


(B.47) n(y) = Qn(): 


and see that, for y? <n, 


1 2 2 2 1 
B.48 n(Y) = ——— DE Mae" /2)114 4 O 
oe ee [i+ (=) 
C. Natural extension of weak* convergence of measures 


Let X be a compact metric space, js a finite positive Borel measure on X. If 
f : X — Risa bounded function, we say f € R(X, 1) provided that, for each 
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€ > 0, there exist 
(C.1) u,v € C(X) such that u < f <v, and fe-od<e. 
xX 


If X = S', the unit circle, and ju is Lebesgue measure, this class coincides with 
the standard notion of Riemann integrable functions. 

The Lebesgue criterion for Riemann integrability extends in a natural fashion 
to this general setting: 


Proposition C.1. Take X and yas above, and let f : X — R be bounded. Set 
(C.2) Dy = {x © X: f is not continuous at x}. 
Then Df is a Borel set, and 
(C3) f € R(X, p) => w(Dy) =0. 

The proof of the classical result (cf. [Tay2], Proposition 3.10) is readily adapted 
to establish Proposition C.1. We leave this as an exercise for the reader. 


It is of great interest that weak* convergence 1; — jz has the following exten- 
sion property. 


Proposition C.2. Take X, 1 as above, and let vy, be finite, positive Borel mea- 
sures on X. Assume 


(C.4) Vp —> pb, weak* in M(X) = C(X)’. 
Then, if f : X — Ris a bounded, Borel function, 
(C.5) FER(X uw —> f fans f fay. 


Proof. Given f € R(X, y), take € > 0 and pick u, v such that (C.1) holds. Then 


(C.6) [fans foans fod< | tau+e, 


so 

(C.7) limsup f fara < [ fay 
k-00 

Similarly 


(C.8) limint [ fare > f Fay, 
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so we have (C.5). 


EXAMPLE. Let X = R =RU {oo}, and let 14, and yu be probability measures on 
R, naturally extended to R, so that vz ({co}) = u({co}) = 0. Let 


(C.9) f :R—R bea bounded, continuous function. 


Then f extends to a bounded function on R, with only oo as a point of disconti- 
nuity. Hence f € R(R, ju), and (C.5) applies, so if (C.4) holds, 


(C.10) [te — f fan, 


for all f satisfying (C.9). Applying (C.10) to the (real and imaginary parts of) 
fe(x) = e~**S yields the following. 


Corollary C.3. Let vz and ys be probability measures on IR, and assume vx, > [L 
in S'(R), hence weak* in C(R)'. Then 


(C.11) De(E) — AS), VEER. 
Weak* convergence of measures and uniform convergence of distribution 
functions 


let v;, and ps be probability measures on R. The conditions 


Vp —> pw in D’(R), 
(C.12) Vv, — in S’(R), 
Vp > je weak* in M(R) = C(RY 


are all equivalent. They say 


(C.13) [fe — f fay, 


for f € Cg°(R), f € S(R), and f € C(R), respectively. Let us now assume 
(C.14) has no atoms. 


Then, by Proposition C.2, (C.13) holds for f = x(—c0,2], for each x € R. In other 
words, if we set 


(C.15) ®1(x) = v4((—00, 2]), G(x) = u((—00, 2), 


332 3. Fourier Analysis, Distributions, and Constant-Coefficient Linear PDE 
we have 

(C.16) ®,(x) —> G(x), VaeR. 

We note the following useful refinement. 


Proposition C.4. [f 1, and «are probability measures on R satisfying (C.12) and 
(C.14), then 


(C.17) ®, — G, uniformly on R. 

Proof. If not, there exist ¢ > 0, kn — oo, and x,%,, € R such that 

(C.18) Px, (Tk,,) — G(r, )| 2 - 

If G(yo) = ¢/4 and G(y1) = 1 —«/4, then only finitely many x,,, can lie outside 
[yo, yi]. Hence there is a subsequence (which we merely denote j) of (k,,) such 
that 

(C.19) tj > y€l[yo,m], |®;(xj) — G(a;)| 2 €. 

Then there is either a further subsequence satisfying x; /“ y or one satisfying 
x; \, y. Let’s deal with the first possibility; a similar argument will handle the 


second. 
To start, pick N so large that 


(C20) [®;(y) — Gy)| < =, and |G(aj)— GQ) <5, Vj>N. 


It follows that 


E ‘ 
(C21) By) Gal <5, WIEN, 
hence, if (C.16) holds, 

€ . 
(C.22) IP(2j) -)|>5, VIZN, 


hence v;([x;,y]) > €/2 for j > N, and a fortiori 
E 
Now we take 7 — oo to conclude that 


(C.24) w([tn,y]) > 
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i.e., G(y) — G(ayn) > €/2, contradicting (C.20). This finishes the proof. 
REMARK. Coming full circle, we can apply d/dx to (C.17) and obtain (C.12). 
Convergence of Fourier transforms of probability measures 
Here we discuss some converse results to Corollary C.3. 


To begin, let ju, and ys be probability measures on R. We have the Fourier 
transform 


(C.25) mG e 8 du(ax 


= Fal 


with ji,(§) similarly defined. The functions fi and jiz are bounded continuous 
functions on R, all equal to (277)~!/? at € = 0. Let us assume that 


(C.26) Jim fix(€) = fl), VEER. 


Now take u € S(R). We have 


(C27) ™ / u(é)alé) dé 


= f ady, 


the convergence holding by (C.26) and the dominated convergence theorem. Since 
the Fourier transform maps S(R) isomorphically onto itself, we have 


(C.28) [fam — [ifaw as k —} 00, 
for all f € S(R), hence, by denseness, for all f € C,,(IR), and therefore for all 
(C.29) FEC), 


Here is a refinement of the result established above. Still assume jz; are prob- 
ability measures on R. Replace the assumption (C.26) by 


(C.30) jur(€E) — x(€), VEER, xe C(R). 


Then, in place of (C.27), we have 
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(C.31) 


ee LONGI: 


for each u € S(R). 
Meanwhile, passing to a subsequence, we have 


(C.32) 


[tk —> , weak* in M(R), 


for some probability measure A on R. Denote the restriction of \ to R by ju, so 
p(R) + A({oo}) = 1. We have 


(C.33) 
hence 


(C.34) 


le — pw in S'(R), 


[by —> ft, in S'(R). 


The hypothesis (C.30) and the dominated convergence theorem then imply 


(C.35) 


=X, 


as elements of S’(R). The hypothesis that x € C(R), then yields w(R) = 1, 
hence A({oo}) = 0, hence ys = A, and we have (C.26), hence the consequences 
of that hypothesis, including (C.28)-(C.29). 
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Sobolev Spaces 


Introduction 


In this chapter we develop the elements of the theory of Sobolev spaces, a 
tool that, together with methods of functional analysis, provides for numerous 
successful attacks on the questions of existence and smoothness of solutions to 
many of the basic partial differential equations. For a positive integer k, the 
Sobolev space H*(R") is the space of functions in L?(IR") such that, for |a| < k, 
Du, regarded a priori as a distribution, belongs to L?(IR”). This space can be 
characterized in terms of the Fourier transform, and such a characterization leads 
to a notion of H*(R”) forall s € R. For s <0, H*(R”) is a space of distributions. 
There is an invariance under coordinate transformations, permitting an invariant 
notion of H*(M) whenever M is a compact manifold. We also define and study 
H*(Q) when Q. is a compact manifold with boundary. 

The tools from Sobolev space theory discussed in this chapter are of great use 
in the study of linear PDE. This will be illustrated in the next two chapters, and 
throughout Volume 2. Chapter 13 will develop further results in Sobolev space 
theory, including L?-Sobolev spaces, which will be seen to be of use in the study 
of nonlinear PDE. 


1. Sobolev spaces on R” 
When k > 0 is an integer, the Sobolev space H*(IR") is defined as follows: 
(1.1) H*(R”) = {u € L?7(R") : D®u € L?(R") for |a| < k}, 


where D“v is interpreted a priori as a tempered distribution. Results from Chap. 3 
on Fourier analysis show that, for such k, if u € L?(R"), then 


(1.2) u € H*(R”) => (€)* ae L?(R"). 
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Recall that 
1/2 


(1.3) (€) = (1+ lél?) 


We can produce a definition of the Sobolev space H*(IR”) for general s € R, 
parallel to (1.2), namely 


(1.4) H*(R”) = {ue S'(R”) : (€)°& € L?(R”)}. 
We can define the operator A* on S’(R”) by 

(1.5) Mu=F-*( (en): 

Then (1.4) is equivalent to 

(1.6) H*(R") = {we S’(R"): Atue L7(R")}, 


or H*(R") = A~$L?(IR"). Each space H°(R™) is a Hilbert space, with inner 
product 


(1.7) (u, v) Hs(R”) = (A°u, A*v) L?(R”)* 


We note that the dual of H*(R”) is H~*(R”). 
Clearly, we have 


(1.8) D; : H*(R") — H*1(R"), 
and hence 
(1.9) D*® : H°(R") — H*~!(R"). 


Furthermore, it is easy to see that, given u € H*(R”), 
(1.10) u € HST! (R") <> Djue H4(R"), Vi. 


We can relate difference quotients to derivatives of elements of Sobolev spaces. 
Define 7,,, for y € R”, by 


(1.11) Tyu(x) = u(x + y). 
By duality this extends to S’(R"): 
(T-y¥, v) — (u, Ty). 


Note that 
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(1.12) mv=F (el 0), 


so it is clear that r, : H*°(IR") — H*(R”) is norm-preserving for each s € R, 
y € R”. Also, for each u € H*(R"), Tyu is a continuous function of y with 
values in H*(R”). The following result is of frequent use, as we will see in the 
next chapter. 


Proposition 1.1. Let (e1,...,€n) be the standard basis of R"; let u € H*(R”). 
Then 

o”' (Toe, — u) is bounded in H*(R"), 
foro € (0, 1], ifand only if Dju € H*(R"). 
Proof. We have 0~!(tTz-,u—u) + iDjuin H*—'(R") aso > Oifu € H*(R"). 
The hypothesis of boundedness implies that there is a sequence o,, —> 0 such that 
0, '(To,e,U — U) converges weakly to an element of H*(R"); call it w. Since 


the natural inclusion H*(R") <+ H*~!(R®) is easily seen to be continuous, it 
follows that w = iD,u. Since w € H*(R”), this gives the desired conclusion. 


Corollary 1.2. Given u € H*(R"), then u belongs to H***(IR") if and only if 
Tyu is a Lipschitz-continuous function of y with values in H®(R”). 


Proof. This follows easily, given the observation (1.10). 


We now show that elements of H*(IR”) are smooth in the classical sense for 
sufficiently large positive s. This is a Sobolev imbedding theorem. 


Proposition 1.3. [fs > n/2, then each u € H*(R”) is bounded and continuous. 


Proof. By the Fourier inversion formula, it suffices to prove that t(€) belongs to 
L*(R"). Indeed, using Cauchy’s inequality, we get 


aif jae) ae < ( [wore we) 7 ( [o> i) 


Since the last integral on the right is finite precisely for s > n/2, this completes 
the proof. 


1/2 


Corollary 1.4. [fs > 1/2 +k, then H°(R”") c C*(R"). 


If s = n/2+a,0 < a < 1, wecan establish Hélder continuity. For a € (0,1), 
we say 


(1.14) u € C°(R") <=> wu bounded and |u(# + y) — u(x)| < Cly|*. 


An alternative notation is Lip“(R”); then the definition above is effective for 
a € (0,1). 
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Proposition 1.5. [fs = n/2+a,0<a< 1, then H*(R") C C°(R”"). 


Proof. For u € H*(R"), use the Fourier inversion formula to write 


ju(a + y) — u(a)| = (27) ee: ei” §( (e€ — 1) ae| 


1/2 
se Jwerer 4)" (fester) 


Now, if |y| < 1/2, write 


(1.15) 


[let Pen ag 
ae co f were asa f Qt ae. 
lS l= 


If we use polar coordinates, the right side is readily dominated by 


ae 


(1.17) Cly|? + Cly|? ely, 


—2 
provided 0 < a < 1. This implies that, for |y| < 1/2, 

(1.18) ju(z + y) — u(x)| < Calyl®, 
given u € H*(R"), s = n/2 + a, and the proof is complete. 


We remark that if one took a = 1, the middle term in (1.17) would be modified 
to Cly|? log( ), so when u € H”/?+1(IR”), one gets the estimate 


lu(e+ 9) ~ u(2)| < Cl (lo) 


Elements of H"/2+1(IR”) need not be Lipschitz, and elements of H”/?(R”) need 
not be bounded. 
We indicate an example of the last phenomenon. Let us define u by 


. ()-" 

1.19 = 
(1.19) «= SB 
It is easy to show that u € H"/?(R”). But & ¢ L'(IR"). Now one can show that if 
& € Li. (R”) is positive and belongs to S’(IR”), but does not belong to L(R”), 
then u ¢ L°°(R”); and this is what happens in the case of (1.19). For more on 
this, see Exercises 2 and 3 below. 
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A result dual to Proposition 1.3 is 
(1.20) 6€ H-"/2-£(R"), forall e > 0, 
which follows directly from the definition (1.4) together with the fact that 
F5 = (2n)-"/?, by the same sort of estimate on f(€)~?°dé used to prove 
Proposition 1.3. Consequently, 


(1.21) D5 ¢ H-™/2-lel-£(R"), for all e > 0. 


Next we consider the trace map 7, defined initially from S(R") to S(R"~') 
by ru = f, where f(x’) = u(0, 2’) if = (a1,..., Un), 0’ = (2,...,2n). 


Proposition 1.6. The map 7 extends uniquely to a continuous linear map 
1 
(1.22) roi (R) + A(R), forse S >: 
Proof. If f = Tu, we have 
(1.23) = 5 if ii(€) ag 
. — Vor 1; 
as a consequence of the identity { g(a1)e~'"15+ dx, dé, = 27g(0). Thus 


AEP < 5 ( f iacePte?aer) (fae) 


where the last integral is finite if s > 1/2. In such a case, we have 


/ (6)? dé = / (1+ |e +e) dey 


(1.24) 

= O(1 + fe PPM? = ong) 70-4”. 
Thus 
(1.25) (E)20-U) FER <e ‘| Ha(€)|P(E)2* aes, 


and integrating with respect to €’ gives 
(1.26) Il fllfrs—s/2¢@n—21y S CllullF=ceny: 
Proposition 1.6 has a converse: 


Proposition 1.7. The map (1.22) is surjective, for each s > 1/2. 
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Proof. If g ¢ H*~1/2(R"~*), we can let 


(1.27) acd) = 9(é') 


It is easy to verify that this defines an element u € H*(R") and u(0, x’) = cg(z’) 


for a nonzero constant c, using (1.24) and (1.23); this provides the proof. 


In the next section we will develop a tool that establishes the continuity of a 
number of natural transformations on H*(R”), as an automatic consequence of 
the (often more easily checked) continuity for integer s. This will be useful for 


the study of Sobolev spaces on compact manifolds, in §§3 and 4. 


Exercises 

1. Show that S(IR”) is dense in H*(R”) for each s. 

2. Assume v € S’(R”) M Li. (R”) and v(€) > 0. Show that if 6 € L°(R”), then 
v € L'(R”) and 

(2m)"/?||6|| 22 = lull. 
(Hint: Consider vp (€) = y(E/k)v(E), with x € Cp? (R”), x(0) = 1.) 

3. Verify that (1.19) defines u € H"/*(R”), u ¢ L®(R"). 

4. Show that the pairing 

(uv) = f a(eole) ag = f a(eney*o(@y(ey~* ae 
gives an isomorphism of H~*(IR”) and the space H*(R")’, dual to H*(R"). 
5. Show that the trace map (1.22) satisfies the estimate 
IIrul2agen—ay < Cllall2 - Valle, 
given u € H'(IR”), where on the right L? means L?(R”). 

6. Show that H*(R”) is an algebra for k > n/2, that is, 

u,v € H*(R”) = w € H*(R"). 
Reconsider this problem after doing Exercise 5 in §2. 

7. Let f : R — R be C®™, and assume f(0) = 0. Show that u ++ f(u) defines a 
continuous map F : H*(IR") — H*(IR"), for k > n/2. Show that F is a C1-map, 
with DF(u)v = f’(u)v. Show that F is a C°°-map. 

8. Show that a continuous map F : HA**™(R”) > H*(R”) is defined by 
F(u) = f(D™u), where D™u = {Du : |al < m}, assuming f is smooth in 
its arguments, f = 0 at u = 0, and k > n/2. Show that F is C*, and compute 
DF(u). Show F is a C®-map from H**™(R”) to H*(R”). 

9. Suppose P(D) is an elliptic differential operator of order m, as in Chap.3. If 0 < 


s +m, show that 


u € H?(R"), P(D)u = f € H°(R") > ue Het (R”). 
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(Hint: Estimate (€)°*""% in terms of (€)° a and (€)* P(€)%.) 
10. Given 0 < s <1 andu € L?(R"), show that 


(1.28) we A*(R") <> / t PY eu —ullz2 dt< 00, 1<j<n, 
0 


where Ty is as in (1.12). 
(Hint: Show that the right side of (1.28) is equal to 


(1.29) / vs (EAE)? a€, 


Rn 


where, for 0 < s < 1, 
(1.30) Ws(€j) = 2 f ¢~@**)) (1 — cos t€;) dt = C.|E;|"*.) 
0 


11. The fact that u € H*(R”) implies that o~1(t.e,u — u) > iDjuin H*—*(R”) was 


J 
used in the proof of Proposition 1.1. Give a detailed proof of this. Use it to provide 


details for a proof of Corollary 1.4. 
12. Establish the following, as another approach to justifying Corollary 1.4. 


Lemma. /fu € C(R”) and Dju € C(R") for each j (Dju regarded a priori as a 
distribution), then u € C*(R”). 


(Hint: Consider y- * u for p-(z) = e "p(a/e), p € Coe (R"), f ydax = 1, and let 
Ee — 0.) 


2. The complex interpolation method 

It is easy to see from the product rule that if M, is defined by 

(2.1) Mau = p(x)u(z), 

then, for any integer k > 0, 

(2.2) M, : H*(R") — H*(R"), 

provided y is C™ and 

(2.3) Dy € L*(R"), for alla. 

By duality, (2.2) also holds for negative integers. We claim it holds when k is 
replaced by any real s, but it is not so simple to deduce this directly from the 


definition (1.4) of H*(R”). Similarly, suppose 


(2.4) y:R" > R" 
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is a diffeomorphism, which is linear outside some compact set, and define y* on 
functions by 


(2.5) x" u(x) = u(x(#)). 
The chain rule easily gives 
(2.6) x* : HF(R") — H*(R"), 


for any integer k > 0. Since the adjoint of y* is ~)* composed with the operation 
of multiplication by |det Dz)(x)|, where 7) = 1, we see that (2.6) also holds 
for negative integers k. Again, it is not so straightforward to deduce (2.6) when k 
is replaced by any real number s. A convenient tool for proving appropriate gen- 
eralizations of (2.2) and (2.6) is provided by the complex interpolation method, 
introduced by A. P. Calderon, which we now discuss. 

Let £ and F' be Banach spaces. We suppose that fF’ is included in F, and the 
inclusion F' + F is continuous. If ( is the vertical strip in the complex plane, 


(2.7) Q={zEC:0<Rez<l}, 


we define 


(2.8) He,r(Q) = {u(z) bounded and continuous on 2 with values in F; 
holomorphic on Q : ||u(1 + iy)|| 7 is bounded, for y € R}. 


We define the interpolation spaces [E, F']g by 

(2.9) [E, Flo = {u(0) : ue He r(Q)}, 9 € [0,1]. 

We give |E, F']g the Banach space topology, making it isomorphic to the quotient 
(2.10) Hep (Q)/{u: u(A) = Of. 

We will also use the convention 

(2.11) [F, Elo = [E, Fli_o. 


The following result is of basic importance. 


Proposition 2.1. Let E', F be as above; suppose E ; F are Banach spaces with F 
continuously injected in E. Suppose T' : E: —> E is a continuous linear map, and 
suppose T : F -» F. Then, for all 0 € [0,1], 


(2.12) T : [E, Flo > [E, Fo. 
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Proof. Given v € [E, Fo, let u € Hz,r(Q), u(@) = v. It follows that Tu(z) € 
Hp p(Q), so Tv = Tu(6) € [E, Fo, as asserted. 


We next identify [H,D(A)]g when H is a Hilbert space and D(A) is the 
domain of a positive, self-adjoint operator on H. By the spectral theorem, this 
means the following. There is a unitary map U : H — L?(X,,) such that 
B =UAU~' is a multiplication operator on L?(X, 11): 


(2.13) Bu(«) = Mpu(x) = b(x)u(x). 
Then D(A) = U~!D(B), where 
D(B) = {ue L?(X, 1): bu € L?(X,)}. 


We will assume b(a) > 1, though perhaps b is unbounded. (Of course, if b 
is bounded, then D(B) = L?(X,,) and D(A) = H.) This is equivalent to 
assuming (Au,u) > |lu||?. In such a case, we define A? to be U-!B°U, 
where B’u(x) = b(x)°u(x), if @ > 0, and D(A®) = U-!D(B®), where 
D(B°) = {u € L?(X, pw) : b°u € L?(X, )}. We will give a proof of the spectral 
theorem in Chap. 8. In this chapter we will apply this notion only to operators A 
for which such a representation is explicitly implemented by a Fourier transform. 
Our characterization of interpolation spaces [H, D(A)]9 is given as follows. 


Proposition 2.2. For 6 € [0,1], 
(2.14) [H, D(A)]o = D(A?). 


Proof. First suppose v € D(A’). We want to write v=u(0), for some 
u€ Hypa) (Q). Let 
u(z) = A749, 
Then u(9@) = v, u is bounded with values in H, and furthermore u(1 + iy) = 
A-1A~'Y(A®%v) is bounded in D(A). 
Conversely, suppose u(z) € Hx,p(.4) (2). We need to prove that u(@) € D(A’). 
Let ¢ > 0, and note that, by the maximum principle, 


|| A?(I + te A)~ 1 u(z) |x 
(2.15) < BUD max { (T+ ic A) A u(iy) ||, 
ye 


|A'tY (I + ic A)*u(1 + ty) } < C, 


with C independent of ¢. This implies u(@) € D(A®), as desired. 


Now the definition of the Sobolev spaces H*(IR”) given in §1 makes it clear 
that, for s > 0, H*(R”) = D(A*), where A®* is the self-adjoint operator on 
L?(R") defined by 
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(2.16) Mar Mest, 

where F is the Fourier transform. Thus it follows that, for k > 0, 

(217) [L7(R”), H*(R")Jo = H*°(R"), 6 € [0,1]. 

In fact, the same sort of reasoning applies more generally. For any 0, s € R, 
(2.18) [H?(R”), H°(R”)]o = H8t+C-%e(R"), 6 € [0,1]. 


Consequently Proposition 2.1 is applicable to (2.4) and (2.6), to give 


(2.19) M, : H*(R”") — H*(R") 
and 

(2.20) \x* : H°(R") — H*(R”), 
foralls € R. 


It is often convenient to have a definition of [F, F']g when neither Banach space 
EF nor F is contained in the other. Suppose they are both continuously injected into 
a locally convex topological vector space V. Then G = {e+ f:e€ E,f € F} 
has a natural structure of a Banach space, with norm 


llelle =inf{llellz + |Ifllp:a=e+ finV, ec E, fe Fh. 


In fact, G is naturally isomorphic to the quotient (EF @ F’)/L of the Banach space 
E © F, with the product norm, by the closed linear subspace L = {(e, —e) : € € 
ENF CV}. Generalizing (2.8), we set 


(2.21) 
He, r(Q) = {u(z) bounded and continuous in 2 with values in G; holo- 


morphic in Q : |/u(zy)||~@ and ||/u(1 + zy)||~ bounded, y € R 
rp y 7] 7] , 


where 2 is the vertical strip (2.7). Then we define the interpolation space [E, F']g 
by (2.9), as before. In this context, the identity (2.11) is a (simple) proposition 
rather than a definition. 

Typical cases where it is of interest to apply such a construction include EL = 
L?1(X, uw), F = L”2(X, yw). If (X, yw) is a measure space that is neither finite nor 
atomic (e.g., R” with Lebesgue measure), typically neither of these L?-spaces is 
contained in the other. We have the following useful result. 


Proposition 2.3. Take 6 € (0,1), pi € [1,00), po > 1. Assume either p(X) < 
oo or pg < co. Then 
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(2.22) [L?*(X, uw), LP? (X, w)Jo = LX, yw), 


where pj, po, and q are related by 


233) = 


Proof. Given f € L%, one can take c = (q — p1)/pi8 = (pe — q)/p2(1 — 4) and 
define 


(2.24) uz) = |Fa@)[P"- F(a), 


by convention zero when f(x) = 0. Then wu belongs to Hy: ,p72, which gives 
L4 C [LP LP2]o. 

Conversely, suppose that one is given f € [L?!,L?]9; say f = u(@) with 
u € Hpr1,pr2(Q). For g € L%’, you can define v(z) = |g(a)|°-* g(x) with 
b = (q' — p})/p8 = (po — ¢')/po(1 — 8), chosen so that v © Hyer p2/(Q). 
Then the Hadamard three-lines lemma, applied to (u(z), u(z)), implies 


0 


1-6 
02.25) Itf,9)l < (sup |(u(i), o())) (sup (u(t + 1,001 + |) | 


for each simple function g. This implies 


1-6 6 


| [glee 


‘a / 
L1 L?2 


(2.26) / f(x)g(@) dy(x) < cla 


= C |gllz«’, 


the last identity holding by (2.23) and the identities b9 + 1 = q’/p), and 
b(@ — 1) + 1 = q'/ps. This implies f € L‘. 


If p(X) = 00 and pz = oo, then (2.24) need not yield an element of H.z71 172, 
but the argument involving (2.25)-(2.26) still works, to give 


[EP (X, pw), B°°(X, we C LX, p), q= 


We record a couple of consequences of Proposition 2.3 and the remark fol- 
lowing it, together with Proposition 2.1. Recall that the Fourier transform has the 
following mapping properties: 


F : L'(R") — L™(R"); F: L?(R") 3 L?(R"). 


Thus interpolation yields 


348 4. Sobolev Spaces 
(2.27) F : L?(R") — L?'(R"), forp € [1,2], 


where p’ is defined by 1/p + 1/p’ = 1. Also, for the convolution product f * g, 
we Clearly have 
Pathe: Tal cis. 


Fixing f € L” and interpolating between L' and L?’ give 


1 1 1 
(2.28) IPx EI CL'", forg€é [1,7], a tae ea 


We return to Hilbert spaces, and an interpolation result that is more general 
than Proposition 2.2, in that it involves D(A) for not necessarily self-adjoint A. 


Proposition 2.4. Let P® be a uniformly bounded, strongly continuous semigroup 
on a Hilbert space Ho, whose generator A has domain D(A) = H,. Let f € Ho, 
0 < @ <1. Then the following are equivalent: 

(2.29) f € (Ho, Milo; 

for some u, 

(2.30) f=u(0), #/?-%ue LA(Rt, M1), #1/2-? & © L9(Rt, Ho); 

(2.31) Sot C8) Pe Ff — f||2,, dt < 00. 

Proof. First suppose (2.30) holds; then u/(t) — Au(t) = g(t) satisfies t!/2-°g € 


L?(Rt, Ho). Now, u(t) = P*f + ie P'~5g(s) ds, by Duhamel’s principle, so 


(2.32) Pif-—f= (u(t) _ f) -{ P'*g(s) ds, 


0 


and hence 


_ Ly Cf 
233) WMPL- Alla $F f lulls a+ Sf lols) a 
0 0 
This implies (2.31), via the elementary inequality (see Exercise 4 below) 


|PA|lc2(a+ teat) S K|lAllc2ca+ weary, 8 <1, 
(2.34) 


where we set 3 = 1 — 20 and take A(t) = ||u’(t)|| #7, or A(t) = ||9(t)|| a. 
Next we show that (2.31) = (2.30). If f satisfies (2.31), set 
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(2.35) u(t) = #9 [ P* f ds, 
where y € CG°(R) and y(0) = 1. Then u(0) = f. We need to show that 
(2.36) t1/2-® Au € L?(R+, Ho) and t!/2-8u! € L?(Rt, Hp). 
Now, t!/?-°Au = o(t)t~!/2-9(P*f — f), so the first part of (2.36) follows 


directly from (2.31). The second part of (2.36) will be proved once we show that 
t1/2-y!' € L?(R+, Ho), where 


t 
(2.37) v(t) = “| Pf ds. 
0 
Now 
t 
(2.38) v(t) = -(P'F f) = | (P’f — f) ds, 


and since the first term on the right has been controlled, it suffices to show that 
t 
(2.39) w(t) = eae) (P*f — f) ds € L?(Rt, Ho). 
0 


Indeed, since s < ¢ in the integrand, 


$1/2-8 t 
|w(tlm <= | his) as, 
(2.40) 0 


A(t) =t""||P*f — fllz, € L?(Rt, edb), 


IA 


so (2.39) follows from (2.34). 

We now tackle the equivalence (2.39) = (2.31). Since we have (2.30) = (2.31) 
and (2.30) is independent of the choice of P*, it suffices to show that (2.29) © 
(2.31) for a single choice of P‘ such that D(A) = Hy. Now, we can pick a 
positive self-adjoint operator B such that D(B) = Hy (see Exercise 2 below), 
and take A = iB, so P' = e*'® is a unitary group. In such a case, the spectral 
decomposition yields the identity 


(2.41) |B? f lle, = co | pO a"? F = fli dt; 
0 


compare (1.28)—(1.30); and the proof is easily completed. 
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Exercises 


1. Show that the class of interpolation spaces [E, F']o defined in (2.9) and (2.15) is 
unchanged if one replaces various norm bounds ||u(x + iy)|| by bounds on e~*!¥! Ju 


(a + ty)|I. 


In Exercises 2 and 3, let Hp = E and H; = F be two Hilbert spaces satisfying the 
hypotheses of Proposition 2.1. Assume H; is dense in Ho. 
2. Show that there is a positive self adjoint operator A on Ho such that D(A) = Ay. 
(Hint: Use the Friedrichs method.) 
3. Let Ho = [Ho, Hilo, 0 < 6 < 1. Show that if 0 < r <s <1, then 


[H,, Hs]o = Ha—oyrtos, O<OA<1. 


Relate this to (2.18). 7 

4. Prove the estimate (2.34). (Hint: Make the change of variable e°~)"/?h(e”) = h(r), 
and convert ® into a convolution operator on L?(R).) 

5. Show that, for0 < s < n/2, 


(2.42) H°(R") C L?(R"), Vpe [2, 7 a). 


(Hint: Use interpolation.) 
Use (2.42) to estimate (D°u)(D*v), given u,v € H*(R"), k > n/2, |a| + |B] < k. 
Sharper and more general results will be obtained in Chap. 13. 


3. Sobolev spaces on compact manifolds 


Let M be a compact manifold. If u € D’(M), we say u € H*(M) provided that, 
on any coordinate patch U C M, any w € C5°(U), the element wu € €’(U) 
belongs to H*(U), if U is identified with its image in R”. By the invariance under 
coordinate changes derived in §2, it suffices to work with any single coordinate 
cover of M. If s = k, a nonnegative integer, then H*(I/) is equal to the set of 
u € L?(M) such that, for any £ smooth vector fields X;,..., X¢ on M, ¢ < k, 
X,-+-X;u € L?(M). Parallel to (2.17), we have the following result. 


Proposition 3.1. For k > 0 an integer, 6 € [0,1], 

(3.1) [L?(M), H*(M)|o = H*?(M). 

More generally, for any a, s € R, 

(3.2) [H°(M), H*(M)|9 = H98+C-9-(M). 

Proof. These results follow directly from (2.17) and (2.18), with the aid of a 


partition of unity on M subordinate to a coordinate cover. We leave the details as 
an exercise. 


Similarly, the duality of H*(R”) and H~*(R”) can easily be used to establish: 
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Proposition 3.2. If M is a compact Riemannian manifold, s € R, there is a 
natural isomorphism 


(3.3) H*(M)* = H-*(M). 
Furthermore, Propositions |.3—1.5 easily yield: 


Proposition 3.3. If M is a smooth compact manifold of dimension n, and 
u € H*(M), then 


(3.4) u € C(M) provided s > o 
(3.5) u € C*(M) provided s > 7 +k, 
(3.6) u € C%(M) provided s = ’ +a, w€ (0,1). 


In the case M = T”, the torus, we know from results on Fourier series given 
in Chap. 3 that, for & > 0 an integer, 


(3.7) ue H*(T") <=> S© |a(m)??(m)* < c. 


meZnr 


By duality, this also holds for k a negative integer. Now interpolation, via 
Proposition 2.2, implies that, for any s € R, 


(3.8) u€ H°(T") => S© |f&(m)|?(m)”* < co. 


meZnr 


Alternatively, if we define A* on D’(T”) by 


(3.9) Mu= ye (m)> &i(m) e”™?, 


meZnr 


then, for s € R, 

(3.10) ir") =A 
Thus, for any s,o € R, 

(3.11) A’: H°(T") —> H°-*(T") 


is an isomorphism. 
It is clear from (3.9) that, for any o > 0, 


352 4. Sobolev Spaces 
A? : H8(T") — H°(T") 


is a norm limit of finite rank operators, hence compact. Consequently, if 7 denotes 
the natural injection, we have, for any s € R, 


(3.12) gj: H8t?(T") —> H*(T") compact, Vo>0. 


This is a special case of the following result. 


Proposition 3.4. For any compact M, s € R, 
(3.13) j : H°*?(M) —+ H*(M) is compact, Vo > 0. 


Proof. This follows easily from (3.12), by using a partition of unity to break up 
an element of H**+?(M) and transfer it to a finite set of elements of H**?(T”), 
ifn = dim M. 

This result is a special case of a theorem of Rellich, which also deals with 
manifolds with boundary, and will be treated in the next section. Rellich’s theorem 
will play a fundamental role in Chap. 5. 

We next mention the following observation, an immediate consequence of 
(3.8) and Cauchy’s inequality, which provides a refinement of Proposition 1.3 
of Chap. 3. 


Proposition 3.5. [fu € H*(T”), then the Fourier series of u is absolutely con- 
vergent, provided s > n/2. 


Exercises 


1. Fill in the details in the proofs of Propositions 3.1—3.4. 
2. Show that C’°° (/) is dense in each H*(M), when M is a compact manifold. 
3. Consider the projection P defined by 


Show that P : H*(S') + H*(S*), forall s € R. 
4. Leta € C™(S"), and define Ma by Maf(@) = a(0)f(0). Thus M, : H*(St) > 
H*(S"'). Consider the commutator [P, Ma] = PM. — MzP. Show that 


[P,Ma]f= > a(k+m)f(—m)e' S> a(-k-m)f(mje"™, 


k2>0,m>0 k>0,m>0 


and deduce that, for all s € R, 
[P, Ma] : H*(S') —+ C™(S"). 


(Hint: The Fourier coefficients (a(n)) form a rapidly decreasing sequence.) 
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5. Let aj, bj € C™(S"), and consider T; = Ma, P + Mo, (I — P). Show that 


T\T> = Maja.P + Mb, b. (I > P) aE R, 


where, for each s € R, R: H*(S') > C™(S'"). 
. Suppose a,b € C°°(S") are both nowhere vanishing. Let 


T=M,P+M,(I-P), S=M,-1P+M,-1(I—P). 


Show that ST = I + R, and TS = I + Ro, where R; : H*(S') > C%(S'), for all 
s € R. Deduce that, for each s € R, 


T : H*(S') —+ H*(S") is Fredholm. 


Remark: The theory of Fredholm operators is discussed in §7 of Appendix A, Functional 
Analysis. 
. Let e;(0) = e”°. Describe explicitly the kernel and range of 


Ti = Me; P + Me, (I — P). 


Hence compute the index of Tx. Using this, if a@ and b are nowhere-vanishing, 
complex-valued smooth functions on st compute the index of Ty = Map+ 
M,(I — P), in terms of the winding numbers of a and 6. (Hint: If a and 6 are 
homotopic to e; and ex, respectively, as maps from S' to C \ 0, then T and Tj, have 


the same index.) 


4, 


Sobolev spaces on bounded domains 


Let Q be a smooth, compact manifold with boundary OQ and interior 2. Our goal 


is to 


describe Sobolev spaces H*(Q). In preparation for this, we will consider 


Sobolev spaces H*(IR'}), where IR’) is the half-space 


with 


(4.1) 


Ry ={a € R”: 27 > 0}, 
closure R®. For k > 0 an integer, we want 


H*(R") = {u € L2(R”) : Du € L2(R®) for lal < k} 


Here, Du is regarded a priori as a distribution on the interior IR%. The space 


H*(] 
that 

A 
H*( 


IR”) defined above has a natural Hilbert space structure. It is not hard to show 
the space S(R’) of restrictions to R" of elements of S(IR") is dense in 
R" ), from the fact that, if r,u(x) = u(a1 + 8,%2,...,%n), then T,u > win 
R”) as s \, 0, if u € H*(R"). Now, we claim that each u € H*(R") is the 


restriction to R’ of an element of H KR"). To see this, fix an integer N, and let 
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Eu(x) = u(2), for 71 > 0, 


42 ul 
of De aju(—jri,z’), fora, <0, 


defined a priori for u € S (R"). We have the following. 


Lemma 4.1. One can pick {a1,...,ay} such that the map E has a unique con- 
tinuous extension to 


(4.3) E: H*(R%) — H*(R"), fork< N-1. 


Proof. Given u € S(R”), we get an H*-estimate on Eu provided all the deriva- 
tives of Hu of order < N — 1 match up at x; = 0, that is, provided 


N 
(4.4) (-j)’a; =1, foré=0,1,...,N-1. 


q= 


B 


The system (4.4) is a linear system of N equations for the N quantities a,; its 
determinant is a Vandermonde determinant that is seen to be nonzero, so appro- 
priate a; can be found. 
Corollary 4.2. The restriction map 
(4.5) p: H*(R") —> H*(R%) 
is surjective. 
Indeed, this follows from 
(4.6) pE =I on H*(R%). 
Suppose s > 0. We can define H*(IR’)) by interpolation: 
(4.7) H*(R) = [L7(R"), H*(R")Jo, k> s,s = Ok. 


We can show that (4.7) is independent of the choice of an integer k > s. Indeed, 
interpolation from (4.3) gives 


(4.8) E: H*(Ri) — H°(R”); 
interpolation of (4.5) gives 


(4.9) p: H*(R") — H°*(R"); 
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and we have 


(4.10) pE =I on H*(R‘). 
This gives 
(4.11) H*(R'.) = H°(R")/{u € H*(R") : Ulpe = 0}, 


for s > 0, a characterization that is manifestly independent of the choice of k > s 
in (4.7). 

Now let 2 be a smooth, compact manifold with smooth boundary. We can sup- 
pose that Q is imbedded as a submanifold of a compact (boundaryless) manifold 
M of the same dimension. If Q C R", n = dim Q, you can arrange this by putting 
2 in a large box and identifying opposite sides to get 2 C T”. In the general case, 
one can construct the “double” of ©, as follows. Using a vector field X on OQ 
that points into 2. at each point, that is, X is nowhere vanishing on OC and in fact 
nowhere tangent to OQ, we can extend X to a vector field on a neighborhood of 
OQ in Q, and using its integral curves construct a neighborhood of 02 in (2 dif- 
feomorphic to [0,1) x OQ, a so-called “collar neighborhood” of OQ. Using this, 
one can glue together two copies of 2 along 00 in such a fashion as to produce a 
smooth, compact M as desired. 

If k > 0 is an integer, we define H*(Q) to consist of all u € L?(Q) such 
that Pu € L?(Q) for all differential operators P of order < k with coefficients 
in C*(Q). We use 2 to denote 2 \ OQ. Similar to the case of R”, one shows 
that C%°(Q) is dense in H*(Q). By covering a neighborhood of OQ C M with 
coordinate patches and locally using the extension operator £ from above, we get, 
for each finite NV, an extension operator 


(4.12) EB: H*(Q)— H*(M), 0<k<N-1. 


If, for real s > 0, we define H*(Q) by 


(4.13) H*(Q) =[L7(Q), H*(Q)\e, k= 8, s = Ok, 
we see that 
(4.14) E: H*(Q) — H*(M), 


so the restriction p : H*(M) > H*(Q) is onto, and 
(4.15) HS (Q) » H*(M)/{u € H*(M) : ul, = 0}, 
which shows that (4.13) is independent of the choice of k > s. 


The characterization (4.15) can be used to define H*(Q) when s is a negative 
real number. In that case, one wants to show that the space H*(Q) so defined is 
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independent of the inclusion 2 C M. We will take care of this point in the next 
section. 

The existence of the extension map (4.14) allows us to draw the following 
immediate consequence from Proposition 3.3. 


Proposition 4.3. [f dim Q =n and u € H*(Q), then 
u € C(Q) provided s > os 
u € C*(Q) provided s > ~ + k; 


u € C%(Q) provided s = = +a, a € (0,1). 


wl sy 


We now extend Proposition 3.4, obtaining the full version of Rellich’s theorem. 


Proposition 4.4. For any s > 0,0 > 0, the natural inclusion 
(4.16) j : H8t?(Q) —> H8(Q) is compact. 
Proof. Using EF and p, we can factor the map (4.16) through the map (3.9): 
He+e(Q) —2+ H(Q) 
»| |e 
He+e(M) —*5 H®(M) 
which immediately gives (4.16) as a consequence of Proposition 3.4. 


The boundary 02 of 2 is a smooth, compact manifold, on which Sobolev 
spaces have been defined. By using local coordinate systems flattening out 02, 
together with the extension map (4.14) and the trace theorem, Proposition 1.6, we 
have the following result on the trace map: 

(4.17) TU = Ul oo. 
Proposition 4.5. For s > 1/2, 7 extends uniquely to a continuous map 


(4.18) 7: H®(Q) — H®-/2(8Q). 


We close this section with a consideration of mapping properties on Sobolev 
spaces of the Poisson integral considered in §2 of Chap. 3: 


(4.19) PI: C(S') —+ C(D), 


where 
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(4.20) D=({(2,y)ER ia? +y <1}, 

given explicitly by 

(4.21) PI f(z) = S— f(k)z* + S— f(—k)z, 
k=0 k=1 


as in (2.4) of Chap. 3, and satisfying the property that 


(4.22) u=Plf Au=0inD and ul, = f. 


The following result can be compared with Proposition 2.2 in Chap. 3. 


Proposition 4.6. The Poisson integral gives a continuous map 


(4.23) PI: H°(S') + H*7(D), for s > 5. 
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Proof. It suffices to prove this for s = k — 1/2, k = 0,1,2,...; this result 
for general s > —1/2 will then follow by interpolation. Recall that to say f € 


H*-1/2(§") means 


CO 


(4.24) S> Lf (n)P(k)?* < 00. 


n=—Co 


Now the functions {r!"le’”® : n € Z} are mutually orthogonal in L?(D), and 


1 
4.25 rll ein® |? de dy = an f rire dp = — 
laa inj] 


In particular, f ¢ H~'/?(S) implies 


d2 If@)P(n)7? < 00, 


n=—oCo 


which implies PI f € L?(D), by (4.25). 


Next, if f €¢ H*~1/2($"), then (0/00)" f € H-1/?(S!), for0 < v < k, so 


(0/00)"PI f = PI(0/00)” f € L?(D). We need to show that 


(3) (=) “Pl feL(D), 


forO < w+v < k. Indeed, set 
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Co 


(4.26) Nf= © Inlf(nem’. 


n=— Co 


It follows from Plancherel’s theorem that (0/00)”N“f € H~'/?(S'), for 0 < 
pty <k,if f ¢ H*-1/2(S"), while, as in (2.18) of Chap. 2, we have 


ay fo\* a\" 
(4.27) (3) (5) Ply SPI (=) NY Ff, 


which hence belongs to L?(D). Since PI f is smooth in a neighborhood of the 
origin r = 0, this finishes the proof. 


The Poisson integral taking functions on the sphere S”~! to harmonic func- 
tions on the ball in R”, and more generally the map taking functions on the 
boundary of OQ of a compact Riemannian manifold Q (with boundary), to har- 
monic functions on Q, will be studied in Chap. 5. 


Exercises 


1. Let D be the unit disk in R®, with boundary 0D = S*. Consider the solution to the 
Neumann problem 


Ou _ 
Or 
studied in Chap. 3, §2, Exercises 1-4. Show that, for s > 1/2, 


(4.28) Au =0 onD, g onS', 


(4.29) g € H'(S') = ue Ht7(D). 


(Hint: Write u = PI f, with Nf = g, where N is given by (4.26).) 
2. Let Q be a smooth, compact manifold with boundary. Show that the following versions 
of the divergence theorem and Green’s formula hold: 


(4.30) [iltaw X)uv + (Xu)v + u(Xv)] dV = [&% v)uv dS, 
2 dQ 


when, among X, u, and v, one is smooth and two belong to H 1 (Q). Also show that 


OU 


— d 
ua, 5, 


(4.31) —(u, Av) 129) = (du, dv) 12(9) _ 
an 
for u € H'(Q), v € H?(Q). (Hint: Approximate.) 


3. Show that if u € H?(Q) satisfies Au = 0 on 0 and Qu/dv = 0 on OQ, then u must 
be constant, if 2. is connected. (Hint: Use (4.31) with v = wu.) 
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Exercises 4-9 deal with the “oblique derivative problem” for the Laplace operator 
on the disk D C R?. The oblique derivative problem on higher-dimensional regions is 
discussed in exercises in §12 of Chap. 5. 

4. Consider the oblique derivative problem 


(4.32) Au =0onD, sey 


_ 1 
oe a9 tua gos’, 


where a, b,c € C'™(S") are given. If u = PI f, show that wu is a solution if and only 
if Of = g, where 


(4.33) Q=M.N + Myo. + M. : H*** (S*) — H*(S"*). 
5. Recall A: H*t*(S') — H*(S"), defined by 


(4.34) Af (8) = DUR) FRE, 
as in (3.9). Show that A is an isomorphism and that 
(4.35) A= .N 2 H*(S")—~ H"(8"). 
6. With Q as in (4.33), show that Q = TA with 
(4.36) T = MarwP + Ma_w(I — P)+ R: H*(S') — H*(S'), 


where 
R: H*(S*) — H***(8"). 
Here P is as in Exercise 3 of §3. (Hint: Note that 0/00 = iP.N — i(I — P)N.) 

7. Deduce that the operator @ in (4.33) is Fredholm provided a + 7b and a — tb are 
nowhere vanishing on S”*. In particular, if a and b are real-valued, Q is Fredholm 
provided a and b have no common zeros on S$. (Hint: Recall Exercises 4—6 of §3.) 

8. LetH = {u € C?(D) : Au = 0 in D}. Take s > 0. Using the commutative diagram 


Hett(g1) 72, W+2(D) NH 


(4.37) a| B 


H*(s1) —++ = -H*(8) 


where Q is as in (4.33) and 


Ou Ou 
(4.38) Bu=a5" +ba, + cul gi, 

deduce that B is Fredholm provided a,b € C'™(S$') are real-valued and have no 
common zeros on S*. In such a case, compute the index of B. (Hint: Recall Exercise 
7 from §3. Also note that the two horizontal arrows in (4.37) are isomorphisms.) 

9. Let B be as above; assume a,b,c € C'™(S") are all real-valued. Also assume that a 
is nowhere vanishing on S'. If c/a > 0 on S', show that Ker B consists at most of 
constant functions. (Hint: See Zaremba’s principle, in §2 of Chap. 5.) 
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If, in addition, c is not identically zero, show that Ker B = 0. Using Exercise 8, 
show that B has index zero in this case. Draw conclusions about the solvability of the 
oblique derivative problem (4.32). 

10. Prove that C°°(Q) is dense in H*(Q) for all s > 0. 
(Hint: With EF as in (4.14), approximate Fu by elements of C°(M).) 

11. Consider the Vandermonde determinant 


al 1 1 
xO Uy Cn 
An41(o, : In) = 
a a 
Show that An+1(Zo, Me fncist) is a polynomial of degree n in t, with roots 
Xo,-++,Xn—1, hence equal to A(t — xo)--- (t — @n—1); the coefficient K of t” is 
equal to A, (xo0,..., 2n—1). Deduce by induction that 


An+1(%0, +++, 8n) = Il (tp — 25). 


0<j<k<n 


12. Given 0 < s < land f € L?(R*), show that 
(4.39) f € H*(R*) = | t PPD if — filzaaty dt < 0, 
) 


where 7 f(x) = f(a +t). (Hint: Use Proposition 2.4, with P' f(x) = f(x +t), 
whose infinitesimal generator is d/dz, with domain H'(R*). Note that “=” also 
follows from (4.14) plus (1.28).) 

More generally, given 0 < s < 1 and f € L?(R’,), show that 


(4.40) fern | tO (Ite; f — fllia@ny Ht < 00, 1S5 Sn, 


where Ty, is as in (1.12). 
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Let 2 be a smooth, compact manifold with boundary; we denote the interior by 
Q, as before. As before, we can suppose 2 is contained in a compact, smooth 
manifold M, with OQ a smooth hypersurface. For s > 0, we define H§(Q) to 
consist of the closure of C§°(Q) in H*(Q). For s = k a nonnegative integer, it is 
not hard to show that 


(5.1) H§(Q) = {u € H*(M) : supp u Cc O}. 


This is because a norm giving the topology of H*(Q) can be taken to be the square 
root of 
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K 
(5.2) S-lPiullzza); 
j=l 


for a certain finite number of differential operators P; of order < k, which implies 
that the closure of C§°(Q) in H*(Q) can be identified with its closure in H*(M). 
Since the topology of H*(M) for s ¢ Z* is not defined in such a localizable 
fashion, such an argument does not work for general real s. For a general closed 
set B in M, set 


(5.3) H}(M) = {u € H°(M): supp u c B}. 


It has been proved in [Fu] that, for s > 0, 
1 
(5.4) Hj (Q) = AS(M) if s+ 5 ¢ Z. 


See the exercises below for some related results. 
Recall our characterization of the space H*(Q) given in (4.15), which we 
rewrite as 


(5.5) H°(Q) x H8(M)/Hi(Q), K = M\Q. 


This characterization makes sense for any s € R, not just for s > 0, and we use 
it as a definition of H*(Q) for s < 0. For k € Z*, we can redefine H~*(Q) ina 
fashion intrinsic to 2, making use of the following functional analytic argument. 

In general, if F is a Banach space, with dual E*, and F' a closed linear subspace 
of E, we have a natural isomorphism of dual spaces: 


(5.6) B* sk /F*, 
where 
(5.7) F+ ={ue E*: (v,u) = 0 forall v € F}. 


If E = H*(M), we take F = H*(Q), which, as discussed above, we can regard 
as the closure of C$°(() in H*(M) = E. Then it is clear that F+ = H;-*(M), 
with kK = M \ Q, so we have proved: 


Proposition 5.1. For Q open in M with smooth boundary, k > 0 an integer, we 
have a natural isomorphism 


(5.8) HE(Q)* = H-*(Q). 


Let P be a differential operator of order 2k, with smooth coefficients on Q. 
Suppose 
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L 
(5.9) P= S > AyB;, 


j=l 


where A, and B; are differential operators of order k, with coefficients smooth on 
Q. Then we have a well-defined continuous linear map 


(5.10) P HE) 3 H-*(O), 


and, if At denotes the formal adjoint of A; on Q, endowed with a smooth 
Riemannian metric, then, for u,v € HQ), we have 


L 
(5.11) (u, Pv) => | (Afu, Byv)12(@); 
j=l 


the dual pairing on the left side being that of (5.8). In fact, the formula (5.5) gives 
(5.12) P+ H*(Q) — H*-**(9) 

for all real s, and in particular 

(5.13) P: H*(Q) + H-*(Q), 


and the identity (5.11) holds for v € H*(Q), provided u € H*(Q). In Chap. 5 we 
will study in detail properties of the map (5.10) when P is the Laplace operator 
(sok=1). 

The following is an elementary but useful result. 


Proposition 5.2. Suppose Q is a smooth, connected, compact manifold with 
boundary, endowed with a Riemannian metric. Suppose 0Q. # Q. Then there exists 
a constant C = C(Q) < oo such that 


(5.14) llull72(a) < Clldullz2(q), for u € Hy (Q). 


It suffices to establish (5.14) for u € C°°(Q). Given lilo = 0, one can write 


(5.15) u(“) = — i du, 


(x) 


for any x € 2, where (2) is some path from x to 0Q. Upon making a reasonable 
choice of (a), obtaining (5.14) is an exercise, which we leave to the reader. (See 
Exercises 4—5 below.) 

Finding a sharp value of C’ such that (5.14) holds is a challenging problem, 
for which a number of interesting results have been obtained. As will follow from 
results in Chap. 5, this is equivalent to the problem of estimating the smallest 
eigenvalue of —A on Q, with Dirichlet boundary conditions. 
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Below, there is a sequence of exercises, one of whose implications is that 
1 
(5.16) [L7(Q), Hp (Q)|, = Hé(Q) = H*(Q), O<s< 5 


Here we will establish a result that is useful for the proof. 


Proposition 5.3. Let Q C R” be a bounded region with smooth boundary. If 
0<s< 1/2, and Tu = xqu, then 


(5.17) T : H°(R") — H°(R"). 


Proof. It is easy to reduce this to the case 2. = R", and then to the case n = 1, 
which we will treat here. Also, the case s = 0 is trivial, so we take 0 < s < 1/2. 
By (1.28), it suffices to estimate 


(5.18) | CS) | ryt — Gil|F2(qy dt, 


where u(x) = Tu(x), so, for t > 0, 


Tu(x) — u(x) =u(t+a2)—u(z), x >0 
(5.19) u(t+ x), -t<«<0 
0, xa<-—t 


Hence (5.18) is 
love) co 0 

(5.20) | tO) ru —ullzay ar | ene] |u(t+a)|? dx dt. 
0 0 -t 


The first term in (5.20) is finite for u € H*(R),0 < s < 1, by (1.28). The last 
term in (5.20) is equal to 


foe) t oo t 
| | t~@5+) u(t — a) |? de dt = i | t PD |u(x)|? de dt 
Gay “ewe Bap 
= c.f |x|~?*|u(x)|? dx. 
0 

The next lemma implies that this is finite for u € H*(R),0<s < 1/2. 
Lemma 5.4. If 0 < s < 1/2, then 
(5.22) u € H*(R”) => |z1|~*u € L?(R”). 


Proof. The general case is easily deduced from the case n = 1, which we estab- 
lish here. Also, it suffices to show that, for 0 < s < 1/2, 


364 4. Sobolev Spaces 
(5.23) u€ H8(R) = «2 Ste L?(R*), 


where % = oe Now, for x > 0, u € C§°(R), set 


1 f* ae) 

(5.24) v(x“) = -{ [u(z) — u(y) dy, w(x) =} a dy. 
0 x 

We claim that 
(5.25) u(x) = v(a) — w(x), «>0. 
In fact, if wu € CH°(R), then v(x) — 0 and w(x) — 0 as & + +00, and one 
verifies easily that u/(x) = v’(x) — w'(x). Thus it suffices to show that, for 
0<s<1/2, 
(5.26) l|z~* ull r2¢R+) Ss Cllullz-a), I|a~*w|| 52 ¢R+) < Cllullz- Ry, 


for u € Co? (R). 
To verify the first estimate in (5.26), we will use the simple fact that |v(a)|? < 
(1/2) fy \u(~) — u(y)|? dy. Hence 


(5.27) 
f esw@pars ff aMule) — uy)? dy de 
0 0 0 
= | | (y + 4)-2*) July + #) — u(y)? dt dy 
0 0 


< | yD [ry — ullZaqery dy. 
0 


Since the L?(R*)-norm is less than the L?(IR)-norm, it follows from (1.28) that 
the last integral in (5.27) is dominated by Cllullzr> a)» forO<s<1. 
Thus, to prove the rest of (5.26), it suffices to show that 


1 
(5.28) |z-wlze@y SClevlae@y, 0<8 <5, 


or equivalently, that ||w||52(R+,2-2:dz) < Cllv||z2(@+,2-2*dz)- In turn, this fol- 
lows from the estimate (2.34), with G = 2s, since we have w = ®*v, where ®* 


acting on L?(IR+, a~"dz) is the adjoint of ® in (2.34). This completes the proof 
of the lemma, hence of Proposition 5.3. 


Corollary 5.5. [f Su(a) = v(x) for x € Q, and Sv(x) = 0 for « € R” \ Q, then 


1 
(5.29) S: H°(2) + H°(R"), 0S 8<5. 
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Proof. Apply Proposition 5.3 tou = Ev, where E : H°(Q) > H*(Q) is any 
extension operator that works for0 < s < 1. 


Exercises 


1. Give the a detailed proof of (5.1). 
2. With Tu = Ul so as in (4.17), prove that 


(5.30) Hg(Q) = {u € H'(Q) : ru = OF. 


(Hint: Given u € H'(Q) and ru = 0, define i = u(x) for 2 € O, G(x) = 0 for 
x € M \ Q. Use (4.30) to show that & € H*(M).) 

3. Let u € H*(Q). Prove that u € H}(Q) if and only if 7(Pu) = 0 for all differential 
operators P (with smooth coefficients) of order < k — 1 on M. 

4. Give a detailed proof of Proposition 5.2 along the lines suggested, involving (5.15). 

5. Give an alternative proof of Proposition 5.2, making use of the compactness of the 
inclusion H'(Q) < L?(Q). (Hint: If (5.14) is false, take uj; € Hg(Q) such that 
||du;||z2 — 0, ||w;||;2 = 1. The compactness yields a subsequence u; — v in 
H*(Q). Hence ||v||;2 = 1 while ||dv||,2 = 0.) 

6. Suppose 2 C R” lies between two parallel hyperplanes, 7; = A and x, = B. Show 
that the estimate (5.14) holds with C = (B — A)?/n?. 

Reconsider this problem after reading §1 of Chap. 5. 
7. Show that C™(Q) is dense in H~*(Q), for s > 0. Compare Exercise 10 of §4. 
8. Give a detailed proof that (5.11) is true for u € H§(Q), v € H*(Q). 

(Hint: Approximate u by uj € C§°(Q) and v by vj € C™(Q).) 

9. Show that if P’ is the formal adjoint of P, then (u,Pv) = (Ptu,v) for u,v € 
HG (Q). 


In the following problems, let 2 be an open subset of a compact manifold M, with 
smooth boundary 0Q and closure 2. Let O = M \Q. 
10. Define Z : L?(Q) > L?(M) by Zu(x) = u(x) for x € O, 0 for x € O. Show that 


(5.31) Z:H§(Q) — HE(M), &=0,1,2,... 
and that Z is an isomorphism in these cases. Deduce that 
(5.32) Z : [L7(Q), HE (Qo — HE (M), 0<0<1, kez". 


11. For fixed but large N, let E : H°(O) — H*(M) be an extension operator, similar to 
(4.14), forO < s < N. Define Tu = u— ERu, where Ru = ule: Show that 


(5.33) T : H*(M) —+ H&(M), 0<s8<N. 


Note that Tu = u foru ¢ H5(M). 
12. Set T°u = Tu gq? 80 T° : H*(M) — H§(Q), for 0 < k < N, and hence 


T° : H*®(M) — [L?(Q), He (Q)Jo. 


Show that 
T’jZ = id. on [L?(Q), Hb (OD)]o, 
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where j : H5(M) <— H*(M) is the natural inclusion. Deduce that (5.32) is an 
isomorphism. Conclude that 


(5.34) [L?(Q), Ho (Q)]o = [H&(M), HE(M)]o = HE°(M), 0<6<1. 


13. Show that H5(M) is equal to the closure of Co°(Q) in H°(M). (This can fail when 
OQ is not smooth.) Conclude that there is a natural injective map 


kK: H5(M) — Ho(Q), s>0. 
(Hint: Recall that Hg (@) is the closure of Co (Q) in H*(Q) = H*(M)/Hz(M).) 


14. If Z is defined as in Exercise 10, use Corollary 5.5 to show that 


1 
(5.35) Z:H(2) + H*(M), 0<s8<>5. 


15. Ifv € C™(Q), and w = v on Q, 0 on O, show that w € H*(M), forall s € [0, 1/2). 
If v = 1, show that w ¢ H'/?(M). 
16. Show that 


(5.36) H3(Q) = H°(Q), for0<s< 


Nle 


(Hint: To show that CG° (Q) is dense in H*(Q), show that {u € C°(M) : u = 0 near 
OQ} is dense in H*(M), forO < s < 1/2.) 

17. Using the results of Exercises 10-16, show that, for k € Zt, 

(5.37) [L?(Q), H$(Q)]o = H§(Q) = H*(Q) if s = kd € [0, 4). 


See [LM], pp. 60-62, for a demonstration that, for s > 0, 
1 
Z : Ho(Q) — H*(M) = s—- 5 ¢ Z, 


which, by Exercise 12, implies (5.4) and also, for k € Zt, 


[L?(2), HE (Q)|o = HE (Q) if s=kO¢ Z+ x 


18. If Fis aclosed subspace of a Banach space, there is a natural isomorphism (F//F')* ~ 
Fr ={we E*: (f,w) =0,Vf € F}. Use this to show that 


(5.38) H°(Q)* © Hs*(M). 


19. Applying (5.6) with E = H*(Q), F = H}(Q), in conjunction with (5.8) and (5.38), 
show that for k € N, 


(5.39) H-*(Q) = H5"(M)/Haq(M). 
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6. The Schwartz kernel theorem 
Let M and N be compact manifolds. Suppose 
(6.1) T:C™*°(M) — D'(N) 


is a linear map that is continuous. We give C'°(/) its usual Fréchet space topol- 
ogy and D’(N) its weak* topology. Consequently, we have a bilinear map 


(6.2) B:C™(M) x C°(N) —>C, 
separately continuous in each factor, given by 
(6.3) Biu,v) =(v,Tu), weCc*(M), ve C™(N). 


For such u, v, define 


(6.4) u@ueEC(M x N) 
by 
(6.5) (u@®@v)(x,y)=u(a)o(y), ceM,yeEN. 


We aim to prove the following result, known as the Schwartz kernel theorem. 


Theorem 6.1. Given B as in (6.2), there exists a distribution 


(6.6) KE D'(M x N) 
such that 
(6.7) B(u,v) = (u@ v,k), 


forallu€ C@(M),vu € C™(N). 


We note that the right side of (6.7) defines a bilinear map (6.2) that is con- 
tinuous in each factor, so Theorem 6.1 establishes an isomorphism between 
D'(M x N) and the space of maps of the form (6.2), or equivalently the space of 
continuous linear maps (6.1). 

The first step in the proof is to elevate the hypothesis of separate continuity 
to an apparently stronger condition. Generally speaking, let E and F’ be Fréchet 
spaces, and let 


(6.8) B:ExF—->C 
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be a separately continuous bilinear map. Suppose the topology of FE is defined 
by seminorms p; < pg < p3 < --- and that of F by seminorms q; < qo < 
qd3 < +--+. We have the following result. 


Proposition 6.2. [f 3 in (6.8) is separately continuous, then there exist seminorms 
pK and qr and a constant C" such that 


(6.9) |G(u,v)| < C'px(ujqr(v), web, veF. 


Proof. This will follow from the Baire category theorem, in analogy with the 
proof of the uniform boundedness theorem. Let Sc,; C E consist of wu € EF such 
that 


(6.10) |B(u,v)| < Cq;(v), forall v € F. 


The hypothesis that @ is continuous in v for each u implies 


(6.11) LU So,; = E. 
Oj 


The hypothesis that @ is continuous in u implies that each Sc; is closed. 
The Baire category theorem implies that some Sc.z, has nonempty interior. 
Hence Sj /2., = (2C)~!Sc.y has nonempty interior. Since S,.7, = —S.. and 
Sij2,1p+51/2,1 = S1,x, it follows that S) 7 is a neighborhood of 0 in EF. Picking 
K so large that, for some C4, the set of u € E with px(u) < C, is con- 
tained in this neighborhood, we have (6.9) with C’ = C/C\. This proves the 
proposition. 


Returning to the bilinear map B of (6.2), we use Sobolev norms to define the 
topology of C'°°(M) and of C°(N): 


(6.12) pj(u) = |lullascmy, av) = llullascny- 
In the case of MM = TT”, we can take 


1/2 


(6.13) pi(u)= | > DPullzzq@m)] 


lalsj 


and similarly for p;(v) if N = T”. Proposition 6.2 implies that there are C, Kk, L 
such that 


(6.14) |B(u,v)| < Cllulla«ca|lullaecny- 


Recalling that the dual of H’(N) is H~“(N), we have the following result. 
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Proposition 6.3. Let B be as in Theorem 6.1. Then for some K,L, there is a 
continuous linear map 


(6.15) T: H*(M) — H-*(N) 
such that 
(6.16) B(u,v) = (v,Tu), forue C?(M), vE C™°(N). 


Thus, if a continuous linear map of the form (6.1) is given, it has a continuous 
linear extension of the form (6.15). 


In the next few steps of the proof of Theorem 6.1, it will be convenient to work 
with the case MM = T™, N = T”. Once Theorem 6.1 is established in this case, it 
can readily be extended to the general case. 

Recall from (3.7) the isomorphisms 


(6.17) AS: H°(T”) — AH°-*(T™), 
for all real s, 7, where A? = I — A. It follows from (6.15) that 
(6.18) Tie = (I — A)-IT(I — A)-* : L2(0™) — H°(T") 
as long as k > K/2 andj > L/2+ s. Note that 
(6.19) T = (I —A)/Ty, (I — A)F. 
The next step in our analysis will exploit the fact that if 7 is picked sufficiently 
large in (6.18), then Tj, is a Hilbert-Schmidt operator from L?(T™) to L?(T”). 
We recall here the notion of a Hilbert-Schmidt operator, which is discussed in 
detail in §6 of Appendix A. Let H; and Hp» be two separable infinite dimensional 


Hilbert spaces, with orthonormal bases {u;} and {v;}, respectively. Then A : 
A, — Hz is Hilbert-Schmidt if and only if 


(6.20) S > || Aujll? = 55 lajel? < 00, 
j ik 


where aj, = (Au;, vx). The quantity on the left is denoted ||.A||3,,. It is not hard 
to show that this property is independent of choices of orthonormal bases. Also, if 
there are bounded operators V, : X, — MH, and V2: Hz — Xz between Hilbert 
spaces, we have 


(6.21) |V2AVi lls < ||Vall - ||Allas - [Mill 
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where of course ||V;|| are operator norms. If V; are both unitary, there is identity 
in (6.21). For short, we call a Hilbert-Schmidt operator an “HS operator.” 


From the definition, and using the exponential functions for Fourier series as 
an orthonormal basis, it easily follows that 


(6.22) A~’ is HS on L?(T") <> 5 > = 
Consequently, we can say of the operator T;;, given by (6.18) that 
(6.23) Ty, : L?(T™) — L?(T") is HS if 2k > K and 27 > L +n. 


Our next tool, which we call the Hilbert-Schmidt kernel theorem, is proved in 
86 of Appendix A. 


Theorem 6.4. Given a Hilbert-Schmidt operator 
Ty : L*(X1, u1) —> L*(Xa, 2), 


there exists K € L?(X1 x Xo, [11 X ju) such that 


(6.24) (Tyu, v) p2 = // K (x1, £2)u(x1)v(@2) djs (#1) dito (x2). 


To proceed with the proof of the Schwartz kernel theorem, we can now estab- 
lish the following. 


Proposition 6.5. The conclusion of Theorem 6.1 holds when M = T™ and 
N=T". 


Proof. By Theorem 6.4, there exists K € L?(T™ x T”) such that 


(6.25) (v,Tiqt) = i | K (1, y)u(e)o(y) de dy, 


foru € C™(T™),v € C™(T"), provided Ty, given by (6.18), satisfies (6.23). 
In view of (6.19), this implies 


(v, Tu) = (I — A)v, Ty,(I — A)*u) 


(6.26) | 
= // KGe,9) F=Ayoy) FA, we) Gey, 


so (6.7) holds with 


(6.27) «= (I—A,z)*(I — Ay)? K(x, y) € D'(T™ x T”). 
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Now Theorem 6.1 for general compact M and N can be proved by writing 


(6.28) B(u,v) = 5° B(g;ju, ver); 


ik 


for partitions of unity {y~;}, {W} subordinate to coordinate covers of M and N, 
and transferring the problem to the case of tori. 


Exercises 


1. Extend Theorem 6.1 to treat the case of 
B-OP(M) x CPN) SC, 


when MM and N are smooth, paracompact manifolds. State carefully an appropriate 
continuity hypothesis on B. 
2. What is the Schwartz kernel of the identity map I : C°°(T") — C*(T”)? 


7. Sobolev spaces on rough domains 


With Q C M as in §§4-5, suppose O C 1) is an open subset, perhaps with quite 
rough boundary. As in our definitions of H*(Q) and H¥(Q), we set, for k € Zt, 


(7.1) H*(O) = {ue L?(O) : Puce L7(O), VP € Diff*(M)}, 


where Diff" (M ) denotes the set of all differential operators of order < k, with 
C@ coefficients, on M/. Then we set 


(7.2) H*(O) = closure of C§°(O) in H*(O). 


There exist operators Pyi,...,Pew € Diff*(M) spanning Diff*(M) over 
C™(M), N = N(k), and we can take 


(7.3) llullFre(0) = d_ I Pastll22co): 
j=l 


It readily follows that 
(7.4) H§(O) = closure of C5°(O) in H*(M), 
with u € H*(O) extended by 0 off O. We have 


(7.5) Ho (©) c H5(M), 
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where 
(7.6) HE(M) = {u € H*(M) : suppu Cc O}. 


Unlike in (5.1), the reverse inclusion can fail for rough OO. Here is a condition 
favorable for such a reverse inclusion. 


Proposition 7.1. [fat each point OO is locally the graph of a continuous function, 
then 


(7.7) H§(O) = HE(M). 


In such a case, given u € HE(M ), one can use a partition of unity, slight 
shifts, and mollifiers to realize u as a limit in H*(M) of functions in C§°(O). 


A simple example of a domain O for which (7.7) fails, for all k > 1, is the slit 
disk: 
(7.8) O={reR? + le|<1}\{@n,0) .0< a <1}. 


Another easy consequence of (7.4), plus Proposition 4.4, is that for k > 1, the 
natural injection 


(7.9) H*(O) — L?(©) is compact. 

Also, the extension of u € H&(O) by zero off O gives 

(7.10) Hé(O) — H§(Q), closed subspace. 
Specializing this to & = 1 and recalling Proposition 5.2, we have 
11) ItulZ20) < Elldullzayo), Yue HBO), 


with C < C, where C is as in (5.14). 

Recall the restriction map p : H*(M) — H¥*(Q), considered in §4. Similarly 
we have p : H*(M) + H*(O), but for rough 0O this map might not be onto. 
There might not be an extension operator E : H*(O) — H*(M), as in (4.12). 
Here is one favorable case for the existence of an extension operator. 


Proposition 7.2. If at each point OO is locally the graph of a Lipschitz function, 
then there exists 


(7.12) E:H*(O) > H*(M), fork=0,1, pE =I on H*(O). 


In such a case, given u € H*(Q), one can use a partition of unity to reduce 
the construction to extending uw supported on a small neighborhood in O of a 
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point po € OO and use a bi-Lipschitz map to flatten out OO on this support. Such 
bi-Lipschitz maps preserve H* for k = 0 and 1, and we can appeal to Lemma 4.1. 


If (7.12) holds, then, as in Proposition 4.4, we have 
(7.13) H*(0) 4 L*(0) 


compact. However, for rough OO, compactness in (7.13) can fail. A simple exam- 
ple of such failure is given by 


(7.14) O=|JOx, On ={e € R?: |x — (2-*,0)| < 8-*}. 
k=1 


When (7.12) holds, results on 
(7.15) H(O\=(7(0), A (Ol, 02 5<1, 


parallel to those presented in §4, hold, as the reader is invited to verify. 


Exercises 


1. The example (7.8), for which (7.7) fails, is not equal to the interior of its closure. 
Construct O C R”, equal to the interior of its closure, for which (7.7) fails. 

2. The example (7.14), for which (7.13) is not compact, has infinitely many connected 
components. Construct a connected, open, bounded O C R”, such that (7.13) is not 
compact. 
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Linear Elliptic Equations 


Introduction 


The first major topic of this chapter is the Dirichlet problem for the Laplace 
operator on a compact domain with boundary: 


(0.1) Au=00n0,. tipo =f: 


We also consider the nonhomogeneous problem Au = g and allow for lower- 
order terms. As in Chap. 2, A is the Laplace operator determined by a Riemannian 
metric. In §1 we establish some basic results on existence and regularity of solu- 
tions, using the theory of Sobolev spaces. In §2 we establish maximum principles, 
which are useful for uniqueness theorems and for treating (0.1) for f continuous, 
among other things. 

For general 22, one does not expect to write down an explicit integral formula 
for solutions to (0.1), but when (Q is the unit ball in R” this is possible. The 
resulting formula, called the Poisson integral formula, is derived in §3, generaliz- 
ing the formula for the disk in R? derived in §2 of Chap. 3. In §3 we also derive 
some consequences of this Poisson integral formula, including Harnack inequali- 
ties, Liouville theorems, a removable singularity theorem, and a variant known as 
Bocher’s theorem. 

One of the most famous classical applications of the solvability of (0.1) is to 
a proof of the Riemann mapping theorem. We prove this theorem for bounded, 
simply connected domains, with smooth boundary, in §4. To prove the Riemann 
mapping theorem for general simply connected planar domains, it is necessary to 
extend the existence theory of § | to compact domains whose boundaries are not 
smooth. We provide results on this in §5, not giving an exhaustive treatment but 
going far enough to accomplish the goal of proving the Riemann mapping theorem 
in general in 86. The analysis in 85 makes strong use of the maximum principle 
established in §2. Further results on irregular boundaries will be established in 
Chap. 11, via the use of Brownian motion. 

Sections 7—9 include material on other boundary conditions. Section 7 looks at 
the Neumann boundary condition 
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(0.2) Au=gonQ, ou = f onan. 
OV 

It is shown that the methods of §1 extend to treat this when 2. has smooth bound- 
ary. Unlike the case of the Dirichlet problem, we do not discuss the Neumann 
boundary condition on domains with nonsmooth boundary, though much has been 
done on this; we refer to [Gri, DK, Wil, HMT] and works cited therein. In §8 we 
consider the Laplace operator on k-forms, and derive the Hodge theorem. When 
Q has a boundary, there arise natural boundary conditions, which we treat in §9. 
The Hodge decomposition is extended to the case of manifolds with boundary. 
These results have topological significance, providing useful tools in deRham 
cohomology. We develop some of these topological consequences, particularly in 
exercise sets following §§8 and 9. The results of these sections also have physical 
significance, as will be seen in the analysis of Maxwell’s equations for the electro- 
magnetic field, in Chap. 6. Further use of this material will be made in Chap. 10, 
on index theory, and in Chap. 17, on fluid mechanics. 

In §10 there is a brief return to the Dirichlet problem for the Laplace operator, 
in order to prove the existence of isothermal coordinates on any two-dimensional 
Riemannian manifold. We treat this topic so late in the chapter only to have the 
luxury of exploiting the Hodge star operator, introduced in §8. 

In §11 we discuss general elliptic boundary problems. The method of freez- 
ing coefficients, introduced in §9, plays a major role here in producing Sobolev 
space estimates for variable-coefficient equations out of estimates for constant- 
coefficient equations (and flat boundaries). The latter estimates can be obtained 
via Fourier analysis. We analyze which boundary-value problems lead to esti- 
mates and regularity of the sort obtained in earlier sections for the Dirichlet and 
Neumann problems. These are called regular elliptic boundary problems. Further 
study of regular boundary problems is made in §12. We mention that Hélder space 
estimates for solutions to regular elliptic boundary problems will be obtained in 
88 of Chap. 13. 

At the end of this chapter are three appendices. One studies spaces of functions 
and generalized functions on a compact manifold with boundary, arising from a 
self-adjoint elliptic boundary problem. This material will be useful for the discus- 
sion of fundamental solutions to parabolic and hyperbolic equations in the next 
chapter. The second appendix, on the Mayer—Vietoris sequence, complements 
some results on deRham cohomology obtained in §88 and 9. We illustrate the 
use of this sequence with several applications to topology, including a proof of 
a variant of the Jordan—Brouwer separation theorem, in the smooth case. In the 
third appendix we discuss the topological invariance of de Rham cohomology on 
the class of smooth compact manifolds, and relate this to de Rham’s theorem, 
which gives an isomorphism of de Rham cohomology and another variety of 
cohomology, called singular cohomology. 
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1. Existence and regularity of solutions to the Dirichlet 
problem 


Let 2 be a smooth, compact Riemannian manifold with boundary, © the interior 
of . Let A denote the Laplace operator. We have 


(1.1) (—Au, u) = ||dullZ2(q), for u € CF°(Q). 


We will assume here that each connected component of 22 has nonempty 
boundary. Thus we have the estimate 


(1.2) llezecay S Clldulliagay, we CHO), 
by Proposition 5.2 of Chap. 4. Hence 

(1.3) Ildullz2(@q) © Mullin @y, foru € Hy(Q). 
Recall from §5 of Chap. 4 that 

(1.4) A:Hj(Q) — H+) 


is well defined. It follows that (1.1) continues to hold for wu € Hj(Q). Conse- 
quently, (1.3) implies 


(1.5) (—Au, u) > Cllullgnay) if u € Ho (Q). 
Furthermore, we have 
(1.6) Aull z-1(a) = Cllullan@) if ue Hg(). 
We can now obtain our first existence theorem. 
Proposition 1.1. In (1.4), A is one-to-one and onto. 
Proof. Clearly, (1.6) implies A is injective, with closed range. If it is not surjec- 
tive, there must be an element of (H~1(Q)) * = H4(Q) that is orthogonal to the 
range, that is, an element ug € Hj (2) that satisfies 
(—Au, uo) = 0, forall u € Hg(). 
Setting wu = uo, we deduce from (1.5) that uo = 0, so the proposition is proved. 


Thus there is a uniquely determined inverse 


(1.7) T: H*(Q) — BAO). 
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Note that if y = Au, » = Av, with u,v € H4(), then 
(Ty, v) = (TAu, Av) = (u, Av) 

(1.8) = —(du, dv) = (Au, v) 
=(¢,TY), 

where we have used the fact that (1.1) extends to 

(1.9) (—Au,v) = (du,dv)z2, for u,v € Hy(). 

If we consider the restriction of T to L?(Q), we have 

(1.10) ToT". 


Since T : L?(Q) + Hj(Q), we have by Rellich’s theorem that T is compact on 
L?(Q). We record this useful fact: 


Proposition 1.2. The inverse T to A in (1.4) is a compact (negative) self adjoint 
operator on L?(Q). 


Hence there is an orthonormal basis {u;} of L*(Q) consisting of eigenfunc- 
tions of T: 


(1.11) Tu; = Ph jU;5, by XS 0. 
In view of (1.7), we have 
(1.12) uj € H4(Q), for each j. 


Furthermore, it is clear that 


1 
(1.13) Au; = —AjU;, Aj = ~ +00. 
My 


We next investigate higher-order regularity of solutions to Au = f, and more 
generally to 


(1.14) Iu=f, w€ HG(Q). 
We consider operators L of the form 
(1.15) Lu =—Au+ Xu, 


where X is a first-order differential operator, with smooth coefficients on 2. 
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Theorem 1.3. Given f ¢ H*~1(Q), fork = 0,1,2,..., a solution u € H4(Q) 
to (1.14) belongs to H**"(Q), and we have the estimate 


(1.16) llesll yaaa < Cl|Lul|ze—1 + Cllullze, 
for allu € H**1(Q) 7 H4(Q). 


Proof. First we establish the estimate (1.16) for k = 0. By (1.5), together with 
the estimate 


|(Xu,u)] < Clfullllulles < 5 [ella + lulz], 
we have 
(1.17) Re (Lu, u) > Cllullz — C'llullze, 
for u € Hj (Q). Hence 
(1.18) luli < C Re (Lu, u) + C'|lullZ2. 
Cauchy’s inequality gives 


Re (Lu, u) < C||Lul|z-1]|ul| zo 


(1.19) a 
< Cellullin + SlLully—, 


and taking ¢ small enough, we can absorb the ||u||7,,-term into the left side of 
(1.18), obtaining 


(1.20) lull3. < Cll Luly + Cllullz2, we HE(Q). 
We now proceed to prove Theorem 1.3 by induction on /;. Given that 
u € Hy(Q),Lu=f ¢ H®3Q) = uc HFM) 
and that (1.16) is true, suppose now that 
(1.21) u€ Hi), Lue H*(Q). 
So, we know that u € H**+(Q), and we want to establish that u € H**?(Q) and 


also show that wu satisfies the estimate (1.16), with & replaced by k + 1. 
First, note that, for any y € C™(Q), 
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(1.22) L(xu) = x(Lu) + [L, x]u, 


and since the commutator [L, x] is a first-order differential operator, the hypoth- 
esis (1.21), together with the observation that u € H**1(Q), gives L(xu) € 
H*(Q), so our analysis of u on © can be localized. 

So suppose u, belonging to H**+1(Q) and satisfying (1.21), is supported on a 
coordinate neighborhood Q, either one with no boundary or one in which 02 is 
given by {x,, = 0}. In either case, we now apply (1.16), with u replaced by 


1 1 
(1.23) Due) = 7 [7),nu(x) — u(x)] = i [u(x + he;) — u(zx)], 
where €1,...,€, are the standard coordinate vectors in R”. In case O has no 


boundary, we can take 1 < 7 < n; otherwise 1 < 7 <n — 1. By (1.16), we have 
|Dj,nullzerr < CLD; nul] qe-1 + Cllel|zers 
(1.24) 
< Cl|Dj,nLullzx—1 + Cll[L, Dy nlullza—1 + Cllull Fess. 


As in (1.22), we have a commutator to estimate. This time, there is the following 
result. 


Lemma 1.4. As h \, 0, [L,.Dj,n] is a bounded family of operators of order two: 
(1.25) |[Z, Dj nJullae-1 < Cllullgen, k& 20, 

given u € H3(Q) NM H**+(Q), supported in O. 

Proof. The estimate (1.25) follows directly from 

(1.26) Mp, Dj rlvllae < Cllullae, &>-1,peCc°Q), 

which in turn is easy to demonstrate, as 


[Mo, Djn]v => —M(p 


inp) O77,RU- 
Using (1.25), we can deduce from (1.24) that 

(1.27) [Dj ntllzee S Cl|Lullzn + Cllullgers, 

and passing to the limit h \, 0 gives 


(1.28) Dju € H**(9). 


If O has no boundary, then (1.28) is valid for 1 < 7 < n, and we have u € 
H**?(Q), Otherwise, we have (1.28) for 1 < j < n—1, and it remains to 
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establish 
(1.29) Du € H**1(Q). 
Recall that k > 0. Thus we need to know 
(1.30) D;Dnue HK(Q), 1<j<n. 


But D;D,u = D,Dju € H*(Q) if 1 < 37 < n—1, since we have (1.28) for 
1 <j <n-—1.It remains only to establish 


(1.31) D2ue H*(Q). 
To see this, write 


(1.32) g""(x)DRu=Lu-  S> gi®(x)D;Dyu—Yu, 
(i,k) A(n,n) 


where Y is a first-order differential operator. All the terms on the right side of 
(1.32) have been shown to be in H* (Q). This establishes (1.31) and completes 
the proof of Theorem 1.3. 

From Theorem 1.3, we can draw an immediate corollary about the eigenfunc- 
tions u; of A, satisfying (1.11)-(1.13). We have 


Ljuy = (—A— Aj)uy = 0, 
which gives the following. 
Corollary 1.5. The eigenfunctions u; of A belong to C®(Q). 


We note that the localization argument from (1.22) gives the following local 
regularity result. 


Proposition 1.6. LetO CC ©. Sayu € H1(Q) and Lu = f €¢ H*-1(Q),k > 0. 
Then u € H**+*(O). Thus if f € C*(Q), then u € C(O) for all O CC Q, so 
u € C™(Q). Furthermore, if Q = M is a compact manifold without boundary, 
then, for k > 0, 


(1.33) u€ H}(M), Lu=f € H*1(M) > we A** (Mm). 


We also remark that the first order operator X in (1.15) could have matrix 
coefficients. The regularity result being localizable, we could suppose L operates 
on sections of a vector bundle, as long as the principal part of L has scalar coef- 
ficients. For example, Proposition 1.6 holds when L is the Laplace operator on 
p-forms. We will pursue this further in §8. 

We now turn to a consideration of the following boundary problem for wu: 
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(1.34) Au=0on2, ula, =f, 
where 
(1.35) fe c”(ag) 


is given. Let F € C®(Q) be constructed so that Fag = f. Then (1.34) is 
equivalent to 


(1.36) u=F+, 
where 
(1.37) Av=g=-—AF, U|eq = 0. 


Since g € C™(Q), we see that 
(1.38) v=Tg € Hy(Q) 


satisfies (1.37), and by virtue of Theorem 1.3, v € C°° (Q). Thus, for any f € 
C™ (OQ), we have a unique wu € C°(Q) solving (1.34), assuming each connected 
component of 2 has nonempty boundary. We denote the solution to (1.34) by 


(1.39) u=PIf. 
In analogy with Proposition 4.6 of Chap. 4, we have 


Proposition 1.7. The map (1.39) has a unique continuous extension 


1 
(1.40) PI: H°(aQ) > H*+/7(Q),  s> 5" 
Proof. It suffices to prove this for s = k+ 1/2, k = 0,1,2,..., by interpolation. 
Given f € H**1/2(9Q), there exists F € H*+1(Q) such that F|,. = f, by 
Proposition 1.7 of Chap.4. Then PI f = F'+ v, where v is defined by 


Av=—-AF ¢ HQ), v € HY(Q). 


The regularity result of Theorem 1.3 gives v € H**1(Q), which establishes (1.40) 
fors =k+1/2. 


We note that Proposition 4.6 of Chap. 4 was established for a slightly greater 
range of s than in (1.40), namely for s > —1/2. This was done by analyzing 
the explicit formula for PI f, given by Fourier series. In Chap.7, $12, we will 
obtain an accurate approximation (called a parametrix) for PI, as well as solution 
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operators for other elliptic boundary problems. We advertise here the following 
consequence of Proposition 12.4—Corollary 12.8 of Chap. 7. 


Proposition 1.8. In the setting of Proposition 1.7, we have 


PI: H*(89) — H*+¥/2(Q), s>- 


Nile 


Furthermore, if O © 0Q. is open and f € H*(OQ) is C% on O, then PI f is C™ 
on a neighborhood in Q. of O. 


Amalgamating the equations Au = f, tbl ss = 0 and Au = 0, lag = g, we 
can solve the nonhomogeneous Dirichlet boundary problem 


(1.41) Au= f, td | 5 = 4g. 


Given g € H**+1/2(0Q) and f €¢ H*-1(Q), k = 0,1,2,..., there exists a unique 
solution u € H**+1!(Q). Generalizing (1.16), we have the estimate 


(1.42) lull zr4+1.(0) s Cl|Aullzr-1(9) + Cllullgn+1/2(aa) 7 Cllullzre (ays 


for all u € HP*1(Q). 
Next, we briefly consider existence of solutions to the more general equation 


(1.43) Iu=f, u€ Hy(Q), 
where, as in (1.14), 2 = -A + X, X being a first-order differential operator on 


Q. With T denoting the inverse of A, as in (1.7), we look for a solution of the 
form u = Tv, for some v € H~1(Q). The equation (1.43) becomes 


(1.44) (I-XT)v=-f, ve H7*(Q). 
Note that 
(1.45) XT : H-+(Q) — 1? (Q). 


By Rellich’s theorem, XT is a compact operator on H~!(Q). Thus the Fredholm 
alternative applies to the map J — XT : H~1(Q) + H~'(Q); this map is sur- 
jective if and only if it is injective. Note that v is in the kernel of this map if and 
only if u = Tv € H3(Q) is annihilated by —A + X. We have established the 
following. 


Proposition 1.9. Given a first-order differential operator X on, the map 


(1.46) -A+X : Hj(Q) — H7'(Q) 
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Of course, given a solution to (1.43), the regularity results of Theorem 1.3 
apply. In particular, any element of the kernel of -A + X belongs to C°(). We 
will see in the next section that the map (1.46) is injective when X is a real vector 
field, so one has solvability in that case. 

To close this section, we mention a few situations other than the Dirichlet 
problem on a connected manifold with nonempty, smooth boundary. For exam- 
ple, given a smooth, compact, connected manifold M/ without boundary, we can 
consider an open subset 2, whose closure has nonempty complement, making no 
smoothness assumptions on 02. Then one can still define H}(Q) as the comple- 
tion of C§°(Q) with respect to either of the equivalent norms 


1/2 
(IldullZaca + llullzaqay) oF [Idullz2(@)- 


The estimate (1.5) continues to hold. See §7 of Chap. 4 for more details. One can 
no longer identify Hj (Q)* with H~'(Q), but we still have 


(1.47) A: Hg(Q) — HG(2)*, 


and the proof of Proposition 1.1 extends to show that the map (1.47) is bijective. 
We have a natural injection L? — H}(Q)*, and the inverse operator 


(1.48) T : Hg(Q)* —> HG(Q) 


to A in (1.47) still restricts to a compact, self-adjoint operator on L?(Q). The 
global regularity result of Theorem 1.3 does not extend, although of course, by 
(1.22), one has such a regularity result on the interior. We will take a further look 
at the Dirichlet problem on domains with nonsmooth boundaries in §5. 

Another variation is the case where 2) is compact, without boundary. Then the 
map A: H'(Q) + H~1(Q) is not injective, since 1 € Ker A. But we have 


(1.49) ((-A+1)u,u) = |Idull72(ay + Ilullz2ca: 

which gives 

(1.50) —A+1: H'(Q) —> H~'(Q) bijective. 

Its inverse, 

(1.51) T,: H-(Q) 3 H(Q), 

is again seen to define a compact, self-adjoint operator on L?(Q), so we again 


have an orthonormal basis {u,;} of L?(Q) satisfying Tju; = —pju,;, with w; \,0 
and jig = 1. Hence 
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1 
(1.52) Au; = —AjU;, rj = —— 1 fe Co. 


Mj 


Of course, Ag = 0, with corresponding uo = const. By (1.22), the regularity result 
of Theorem 1.3 extends to this case; we have T; : H*~1(Q) — H**1(Q), for 
k =0,1,2,..., giving a two-sided inverse of the operator -A+1 : H**1(Q) > 
H*~1(Q). By interpolation, we have 


(1.53) T : H°(Q) — Ht), 
for real s > —1, giving a two-sided inverse of 
(1.54) —A+1: H*?(Q) — H4(0). 


Since T; = TY’, by duality (1.53) holds for all real s. 

Returning to the case of 2. with nonempty smooth boundary, we remark that 
boundary problems other than the Dirichlet problem arise naturally, such as the 
Neumann problem. We discuss some of these other boundary problems later in 
this chapter. 


Exercises 


1. Prove the following local boundary regularity result. If uw € Hg(Q) and Lu = f, as 
in Theorem 1.3, and if f| ee H*(O), for some open O C 2, with O N AQ perhaps 
nonempty, then u € H**?(0’) for any open ©’ C O such that OY C OU AN. 

(Hint: Recall the observation about (1.22).) 

2. Let T be the operator inverting A, as in Proposition 1.2. Show that the largest eigenvalue 

Lio of —T satisfies 


yo = sup {(—Tu, u) : u € L7(); |jullz2 = 1}, 


and this supremum is achieved for wu = uo; in fact any v for which this supremum is 
achieved satisfies Tu = —piov. Deduce that 


(1.55) No = inf {|| dul)Z2¢q) su € Ho (Q), |lullz2 = 1}, 


and furthermore, for any v € Hg(Q) for which this infimum is achieved, v is a 
Ao-eigenfunction of —A. 

3. Suppose 2 is an open region in R”, lying between two hyperplanes 7; = A and x; = 
B. If Xo is the smallest eigenvalue of —A, as in (1.55), show that 


1 


ee) al 


Vv 


Xo 


(Hint: First consider the case n = 1.) 

4. Show that the argument preceding Proposition 1.2 has the following generalization. 
Let H° be a Hilbert space, H' a dense linear subspace, with a Hilbert space structure, 
continuously injected in H°. Denote by H~' the conjugate dual of H’, so there are 
continuous inclusions 
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HCH on. 
Suppose L : H' — H7! is continuous, bijective, and Hermitian symmetric. Let T 
denote the restriction of L~': H7' = H! to H?: 


T:H®o —H°. 


Show that T is a bounded self-adjoint operator on H°. Relate this to the Friedrichs 
extension method, discussed in 88 of Appendix A. 

5. Extend Proposition 1.1 to the case where A is the Laplace operator on 2, endowed with 
a continuous metric tensor. 

6. Show that Theorem 1.3 holds in the case k = 1, provided g;; are Lipschitz on Q. 

7. Show that if © is a bounded open set in R” with a C''-boundary, one can smooth 
out the boundary, transforming the Laplace operator to an operator to which Problem 6 
applies. Why doesn’t this work if 9. merely has a Lipschitz boundary? 

8. Consider Lu = Au — V(x)u, that is, L of the form (1.15) with Xu = V(a)u. Show 
that (1.45), and hence Proposition 1.8, hold, provided 


VeL"(Q), n>3, 
where n = dim Q, given that 
H'(Q) c 17"/-2)(0), forn > 3, 


a result that will be established in §1 of Chap. 13. Try to show that Proposition 1.8 holds 
under the even weaker hypothesis 


VeLiQ), @> - 


2. The weak and strong maximum principles 


In this section, we take VM to be a smooth, compact Riemannian manifold without 
boundary, and Q to be a connected open subset of 1/7, with nonempty boundary. 
We will derive several results related to the maximum principle for second-order 
differential operators of the form 


(2.1) Re Aee, 
where X is areal vector field on M. In local coordinates, L has the form 
(2.2) L = g)*(x)0;0, + b'(x)d; 


with (g)* (a)) the metric on cotangent vectors, a positive-definite matrix, and 
(x) smooth and real-valued. We begin with the following, a weak maximum 
principle. 


Proposition 2.1. Suppose Q is an open bounded domain in R" and L is given by 
(2.2), with coefficients smooth on a neighborhood of Q. If u € C(Q) A C?(Q) and 
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(2.3) Tu >odonQ, 

then 

(2.4) sup u(x) = sup u(y). 
rEQ yEoQ 


Furthermore, if 


(2.5) Tu=0onQ, 

then also 

(2.6) sup |u(z)| = sup |u(y)]. 
ze yEon 


Proof. First note that if Zu > 0 on Q, an interior maximum is impossible, since 
dju(x) = 0 and (0;0,u(x)) is negative semidefinite at any interior maximum. 
So we certainly have (2.4) in that case. To show that (2.3)=(2.4), note that if 
QcCR’, 

L(e™) = (yg (a) + yb (a) )e™ > 0, 


for y > 0 large enough. Fix ¥ so large that L(e?”!) > 0. Then, for any « > 0, 
L(u+ ce!) > 0, so we have 


sup u(x) + ¢e7? = sup u(y) + ee™, 
rEQ yEOQ 


for each ¢ > 0. Passing to the limit ¢ \, 0 yields (2.4). If (2.5) holds, then (2.4) 
also holds with u replaced by —u, which gives (2.6). 


In the following proposition, we will not need to suppose 2 C R”; we resume 
the hypotheses on 2. made at the beginning of this section. Proposition 2.3 will 
contain the extension of Proposition 2.1 to this general class of domains; it will 
also be sharper than Proposition 2.1 in other respects. 

The following result, sometimes called Zaremba’s principle, has many impor- 
tant uses, including providing a tool to establish the strong maximum principle. 


Proposition 2.2. In addition to the hypotheses above, suppose OQ. is smooth and 
u € C1(Q)N C?(Q). If Lu > 0 and if y € OQ is a point such that 


(2.7) u(y) > u(x), forallz €Q, 


then, if v denotes the inward-pointing normal to OQ, 


Ou 


(2.8) ap 


(y) <0. 
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FIGURE 2.1 The Boundary Point y 


Proof. Pick a coordinate system centered at y. In this coordinate system, put a 
small ball O in Q, whose boundary is tangent to OC. at y, as illustrated in Fig. 2.1. 

Let p denote the center of O; let R denote the radius of O (in this coordinate 
system; forget about the Riemannian metric on (). Let 


(2.9) r(x)? = |x — pl’, 
for x € O. A short calculation gives 

ie _ Zor) = ne | 
(2.10) 2 . ; 
=e [40?@!* (a, — pj)(&% — Pr) — 2a(g?j + B (aj — »5))| 


What can be deduced from this calculation is that if p € (0, R) is fixed, then, for 
a > 0 sufficiently large, 


(2.11) w= eo” — eR? 
implies 

(2.12) Lw > 0 onthe shell A, 
defined by 

(2.13) A={xEO:r(x) > p} 


(see Fig. 2.2). Consequently, for any « > 0, if w is given by (2.11), then Lu > 0 
implies 


(2.14) L(u+ew) >0 on A, 
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FIGURE 2.2 The Shell 


so, by Proposition 2.1, we have 


(2.15) sup (u + ew) = sup (u+ ew). 
A aA 


Note that w = 0 on 0O = {r(x) = R}. Since, by the hypothesis (2.7), 


(2.16) sup u(x) < u(y), 
{r(x)=p} 


we see that, for ¢ > 0 sufficiently small, the right side of (2.15) is equal to u(y). 
Fix ¢, sufficiently small. Then (2.15) yields 


(2.17) u(x) + ew(x) < u(y), forall x € A, 
and hence 
ae u(y) —ule) wa) _ ele) = wy) 


ly—a| ~ ly-2l  |y-a 
since w(y) = 0. Now the formula (2.11) for w implies 


Ow 


(2.19) ai 


(y) > 0, 
so letting « — y along the normal to 02) at y gives (2.8), as a consequence of 
(2.18). 


We can now elevate Proposition 2.1 to the strong maximum principle. In this 
result, we do not need any smoothness on O02. 


Proposition 2.3. [fu € C(Q) A C?(Q) and Lu > 0, then either u is constant, or 


(2.20) u(z) < sup u(z), foralla EQ. 
z€0Q 
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Proof. First we prove the weaker estimate (2.4) in this case. Indeed, if this 
estimate fails, w must have an interior maximum, say on a nonempty compact 
set kK C Q. If we put a ball D in \ K, touching K at a point y, then Zaremba’s 
principle (2.8), applied to functions on D, contradicts the fact that we must have 
du = 0 at y. This shows that / is empty. 

Now, if u is not constant, we see that O, the set of points x € Q, where u(x) < 
SUPzcaq U(z), is nonempty and open in 2. If O is not all of Q, pick po in the 
boundary of O in Q, as illustrated in Fig. 2.3. Then pick gg € O closer to po than 
to OQ, and let D be the largest ball, centered at go, lying in O. Then OD intersects 
Q \ O at (at least) one point; call it y. 

Since y ¢ O, we must have u(y) = sup u(z) = sup u(x). This implies both 


z€dQ reQ 
that 
(2.21) du(y) = 0 
and that 
(2.22) u(y) > u(x), for alla € D. 


Again, this contradicts Zaremba’s principle for u € C!(D) M C?(D) satisfying 
Lu > 0 in D. Proposition 2.3 is proved. 


In case 2 is a smooth, connected, compact manifold with nonempty smooth 
boundary, recall from §1 that we have a map 


(2.23) PI: C®(8Q) —+ C*(M) 


FIGURE 2.3 Applying Zaremba’s Principle 
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with the property that, for f €¢ C°(0Q), PI f = wu is the unique element of 
C™(Q) satisfying 


(2.24) Au =0o0nQ, las =f. 
It follows from Proposition 2.3 that 


(2.25) sup |u| = sup |f]. 
a aa 


Consequently, the map (2.23) has a unique continuous extension to 
(2.26) PI: C(0Q) —> C(Q). 
We will discuss the situation where O02) is not smooth in 85. 

Using the strong maximum principle, we draw a conclusion about the funda- 
mental eigenspace of A. Let Ao be the smallest eigenvalue of —A, as in (1.13); 
Ao > 0. Assume 2 is a connected, compact manifold with nonempty smooth 


boundary. 


Proposition 2.4. If uo € H4(Q) is an eigenfunction for —A corresponding to 
Xo, that is, 


(2.27) Auo = —ouo, 
then ug is nowhere vanishing on the interior of Q. 
Proof. We have uo € C°°(Q). Define uj and ug , respectively, by 


ug (w) = max (u(x), 0), 


Ug (x) = min (wo(x), 0). 


It is easy to see that 


(2.28) ud ,ug € H5(Q) 
and 
(2.29) dud Reco = | lduol? av, 
Q= 
where 
OF = {x EQ: +uo(x) > O}. 


Next we invoke the variational characterization (1.55) of Ao and associated eigen- 
functions. It follows that either ud or ug must be a Ao-eigenfunction of —A. 
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Therefore, Proposition 2.4 will be proved if we show that its conclusion holds 
under the additional hypothesis that 


(2.30) uo(z) >0 on. 


Indeed, if this holds, then (2.27) yields 


(2.31) A(—uo) = Aouo = 0 on. 
Thus Proposition 2.3 applies to —u, so, since Uo}, = 9, 
(2.32) —uo(z) <0, foralla €Q. 


This finishes the proof of Proposition 2.4. 


Corollary 2.5. [f Ao is the smallest eigenvalue of —A for Q, with Dirichlet 
boundary conditions, as in Proposition 2.4, then the corresponding \o-eigenspace 
is one-dimensional. 


Proof. If there were a \g-eigenvector u; orthogonal to uo, then wu; would have to 
change sign in 2, contradicting Proposition 2.4. 


The following result, involving a zero-order term, is often useful. With L as in 
(2.2), let 


(2.33) Lu = Lu—c(ax)u. 
We assume c € C(Q), with Q c R”, bounded. 
Proposition 2.6. Suppose c(x) > 0 in (2.33). For u,v € C?(Q)N C(Q), 
(2.34) Lu < LvonQ, u> von dQ => u>vonnQ. 
Proof. By linearity, it suffices to show that 
Lv >0onQ, v <00n dD =v < O0onn. 


If we let O = {2 € 2: v(x) > O}, then Lv = cv > 0 on O, and v = 0 on OO. 
But Proposition 2.1 implies supo v = supgg Vv. This is impossible if O 4 0. 


Corollary 2.7. If c(z) > 0 and Lu = 0, then, with a = supgg u, we have 


(2.35) a>0>sup w=a, and a<0>sup u< 0. 
Q Q 


Proof. The first implication follows from (2.34), with (u,v) replaced by (a, u), 
since a > 0= La <0. For the second implication, let O = {x € 2: u(x) > OF. 
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If O £ 0, we must have O C ( and u = 0 on OO. But now the first implication 
of (2.35) applies to u| o> SO we have a contradiction. 


In case L = A, there is the following useful strengthening of Proposition 2.6. 


Proposition 2.8. Asswme c € C(Q) and that L = A — c is negative-definite with 
the Dirichlet boundary condition, that is, 


(2.36) —||dul|72 —(cu,u) <0, for nonzero u € Hj(Q). 
Then, for v € H*(Q), 
(2.37) (A—c)v >00nQ, v < 00n 00 = 0 <0onQ. 


Proof. Let v; = max(v,0). Then the hypotheses in (2.37) imply that vi € 
Hg (Q) and 

—(dv, dvs) — (cv, v4) > 0. 
Since (dv,dv,) = (dv;,dv+) in this case, it follows that —(dv,,du,) — 
(cv, v4) > 0. By (2.36) this implies v; = 0, proving the proposition. 


Further results involving zero-order terms are given in the exercises. 
To close this section, we discuss the extension of (2.26) to 

(2.38) PI: BL (AQ) — L*(Q). 

In fact, by Proposition 1.8, 

(2.39) Plei7 00) 2" (O), 


Given f € L°(0Q), we can use a partition of unity, and mollifiers, to construct 


(2.40) fy € O° (AQ), [lfullo~ S<IIfllae, fu > f, 


the convergence holding pointwise a.e., hence in L?-norm for all p < oo (in 
particular, for p = 2). Then PI f, € C%(Q), PI f, > PIf in H'/2(Q), and 
| PI fullne < |Ifll. < Ilfllze, so |] PE fllz~(a) < If llz(aa)- The following 
is a consequence of the local regularity result stated in Proposition 1.8, together 
with (2.26). 


Proposition 2.9. Let Q be a smooth, connected, compact manifold with 
nonempty, smooth boundary, and let O C OQ be open. If f € L°(O0Q) (or 
more generally f € L?(0Q)) and f is continuous on O, then PI f is continuous 
on a neighborhood in Q. of O. 
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Exercises 


1. If Lu is as in (2.33) with c(x) > 0 and u € C?(Q) MN C(A) satisfies Lu = 0, show 
that ||2||r-0(a) = ||ul|-°(aq). (Aint: Supplement (2.35) with the following for 6 = 
infgag u: 

B<0=> inf u=p, 6>0= inf u>0.) 


2. Show that if u,v € C?(Q) N C(Q), f € C'(R), f’(t) > 0, and 


Lu+ f(u) < —Lv+f(v)onQ, ws<vondQ, 


then u < von 2. (Hint: Let w = u — v. Then —Lw + c(x)w < 0, with 


3. Suppose u € C?(Q) N C(Q) satisfies 

(2.41) Lu= f, Uloo = 4g. 
Suppose V € C?(Q) N C(Q) satisfies 

(2.42) IVe1, Vig =. 
(Note that V < 0 in 2.) Show that, for 2 € Q, 


(x) > (sup f+)V (a) + (inf g), 


u(x) > 
(2.43) u(x) < (inf f-)V(x) + (sup g). 


(Hint. Compare u respectively with v = (sup f+)V + (infg) and with v = 
(inf f_)V + (sup g). In the first case, show that Lu < Lv on Q and u > v on 2.) 


In case 2 is a bounded region in R” and A is the flat Laplacian, apply this with 


1 
V(z) = ral — xo| — Hi, R= re — 29]. 


4. Extend estimates of Exercise 3 to the case 
(2.44) [L—e(z)]u=f, uly, =9, 


under the hypothesis c(x) > 0. Show that if V satisfies (2.42), then (2.43) holds, with 
inf g replaced by inf g— and sup g replaced by sup g+. 


In Exercises 5 and 6, we outline an approach to estimates for a solution v to 
(2.45) [A-e(z)Ju=f, u,.=9, 


where, rather than c(a) > 0, we assume that A — c(x) is negative-definite, with the 
Dirichlet boundary condition, as in Proposition 2.8. For example, we might have 


(2.46) c(z) > pw > —Ao, 


Xo being the smallest eigenvalue of —A on 2, with the Dirichlet boundary condition. 
5. Setv = Fu with F € C?(Q), F > 1 0nQ. Show that (2.45) is equivalent to 
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g 
PF tloa = PF’ 


Lu — ci (x)u = 
where 
Lu = Au+2F7\(VF,Vu), and c1(x) = —F7* [AF — c(x)F]. 
6. Estimates derived in Exercise 4 will apply to u in Exercise 5, provided 
(2.47) AF —c(#)F <0onQ, F>1onf. 
By Proposition 2.8, this holds for F = 1+ F\, provided 
AF, — c(a#)Fi <c_(a)onQ, Fi, > O0o0n an. 
Given (2.46), with  < 0, this holds provided 
(A-—p)Fi <ponQ, Fi >00n dn. 


Using these results, provide estimates for solutions to (2.45), under the hypothesis 
(2.46). 


3. The Poisson integral on the ball in R” 


If B= {x ER": |x| < 1} is the unit ball in R”, with boundary 0B = S”~!, the 
unit sphere, we know there is a unique map 


(3.1) PI: C(S"~1) —+ C(B) NC™(B) 
satisfying 
(3.2) u=Plf MeO... Wye = 7 


We also know that 

(3.3) PI: H°(S"~1) — H°+4/2(B), for s > 
and in particular 

(3.4) PI: C~(S"-1) — C™(B). 
Our goal here is to produce an explicit integral formula for this solution operator. 
Before deriving this explicit formula, we record the classical mean-value property, 


which has been proved, in §2 of Chap. 3. 


Proposition 3.1. For f € C(S"~'), u= PI f satisfies 
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1 
An 


(3.5) u(0) = Avggn—-i f = / f(w) dS(w), 


= 
snr 
where An_, is the area of S"~!; An—1 = Qn”/? /T(n/2). 


In view of its fundamental nature, we give two more proofs of this result, one 
based on the rotational symmetry of the Laplace operator, the other based on 
Green’s formula. 

For the first proof, let 


3.6) (4) = Avgsom) ug-2) =f ulg-) dg 
SO(n) 


be the average of the set of rotates of u(a). Then, since A is rotationally invariant, 
we have 


(3.7) Av=0O onB. 


Now, clearly, 


(3.8) U| gana = AVEgn-1 f =C 
and 
(3.9) v(0) = u(0). 


But a solution to (3.7)—(3.8) is 
(3.10) vo(x) = C, 


and by the maximum principle this solution must be unique. Thus the conclusion 
(3.5) follows from (3.9) and (3.10). 
As was already noted in §2 of Chap.3, we could also obtain uniqueness by 
applying Green’s formula 
Ow 
(3.11) (dw, dw) = —(Au, w) + fw — dS 


Ov 
OB 


to w = v — vo, at least if we know w € O?(B), which in this case would follow 
from u € C?(B). To pass to general u € C(B), harmonic in B, we can replace 
u(x) by u,(x) = u(px) for p < 1, which belongs to C°°(B) since we know 
u € C™(B). Then passing to the limit p 7 1 yields another variation on the 
proof of Proposition 3.1 (which is not counted as the second proof). 


Our second proof uses Green’s formula: 
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Ov 
(3.12) (Au, UV) 12(B) => (u, Av) 12(B) + [or — vs) ds. 
OB 


We will use this in a slightly different context than before, via the following result, 
which is an exercise in distribution theory. 


Lemma 3.2. The formula (3.12) is valid provided u € C®(B) and v is a distri- 


bution on B, equal near OB to a function in C~ (B). 

Thus, we apply (3.12) to u = PI f, assumed to be in C®(B), and to 
(3.13) v(z) =1—|z2\?-"; v(x) = log|a| ifn = 2. 

As shown in Proposition 4.9 of Chap. 3, we have 

(3.14) Av=C,6, Cy = (n—2)An-1, Co = 27. 


Since v = 0 on OB, while 0v/Or = n — 2 on OB, (3.12) yields 


(3.15) (n — 2)An—1u(0) = (n — 2) 7 u(x) dS(2), 

Snr-1 
with an obvious modification for n = 2. We can go from u € C%(B) tou € C(B) 
by the limiting argument described above. This completes the second proof. See 
the exercises for yet another proof. 

Of course, one could use the mean-value property, established via the second 
proof, to derive the maximum principle for harmonic functions on open regions in 
IR”, as was done in Chap. 3, §2. The advantage of the method of §2 of this chapter 
is its much more general applicability. 

We now tackle our main goal of this section, which is to obtain an explicit inte- 
gral formula for the map (3.1). First we recall analogous computations performed 
in Chap. 3. As shown in (5.21) of Chap. 3, 


PI: S(R) — C™(R4) 


is given by 


y [~ __ f(x’) 
(3.16) u(y, x) = [. re dx’. 
Formula (5.24) of that chapter shows that, more generally, 
PI: S(R"~') —+ C®(R") 


is given by 
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fe) ay 
nf? Ge 
[y? ai |a = a! |?] 


(3.17) u(y, @) = Cr—1y 
Rr-1 


Also, formula (2.5) of Chap. 3 shows that 
PI: C~(S1) — C™(D) 


is given by 


—_ 2 / 
(3.18) u(z) = a i, dS(’). 
si 


In order to define PI on C%(S"~'), there are systematic methods, involving 
conformal transformations, which are used in many texts, such as [Hel] and [Keg]. 
The method we will use here is the method of the “inspired guess,’ based on 
extrapolation from (3.16)—(3.18). Note that (3.16) and (3.17) differ only in the 
constant factor in front and in the exponent on (y? + | — x’|?) in the integrand. 
The denominator in the integrand in (3.17) is the nth power of the distance from 
(0, x’) to (y, 2) in R”. This makes it very tempting to try to generalize (3.18) to 


x—a'|” 


(3.19) u(x) = c), (1 — |2|7) ‘| ya dS(x’), 
Ss 


foru = PI f, f € C~(S"~+). We have only to show that this works. First we 
show that u is harmonic in B. This is a consequence of the following. 


Lemma 3.3. For a given x! € S"—! (i.e, 


x'| = 1), set 
(3.20) v(a) = (1 — ||?) |e — 2’|-. 
Then v is harmonic on R” \ {x’}. 


One can apply A to (3.20) in a straightforward manner, but the formulas can 
get very bulky if produced naively, so we give a clean route to the calculations. It 
suffices to show that w(a) = u(a#+2’) is harmonic on R"\0. Since 1—|x+a’|? = 
—(2x - x’ + |x|?) provided |x’| = 1, we have 


(3.21) —w(x) = 2(a' -x)\e|-" + |a/?-™. 


That |z|?~” is harmonic on R” \ 0 we already know, as a consequence of the 
formula for A in polar coordinates, which yields 


n—-1 


(3.22) g(x) = v(r) => Ag = 9"(r) + ——¢"(r). 
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Now, applying 0/0x, to a harmonic function on an open set in R” gives another, 
so the following are harmonic on R” \ 0: 


0 —n —n 
(3.23) w(x) = Ba, z(?-" =(2— n)x;|0|~”. 
For n = 2, we take 
(3.24) Pog foe] = aela|-? 
: da; S12] = Xj , 


Thus the first term on the right side of (3.21) is a linear combination of these 
functions, so the lemma is established. 

To justify (3.19), it remains to show that if u is given by this formula, and c’, is 
chosen correctly, then u = f on S”’-!. Note that if we write x = rw, w € S”7!, 
then (3.19) gives 


3.25) u(re) =f plrwyw')flw') a5w'), 
Sn-1 

where 

(3.26) p(r,w,w)=e (L—r?)\|\rw —w'|-. 


It is clear that 
(3.27) p(r,w,w') +0 asr ZA 1 ifw Au’. 


We claim that 


n? 


(3.28) Fi p(r,w,w’) dS(w’) = ch 


Sn-1 


a constant independent of r. By rotational invariance, this integral is clearly inde- 
pendent of w. Thus we could integrate with respect to w. But Lemma 3.3 implies 
that 


(3.29) p(r,x,w') =c,,(1—r?|2\?)|ra —w"|-” 
is harmonic in x, for |x| < 1/r, so the mean-value theorem gives 


1 
An-1 


(3.30) / p(r,w,w’) dS(w) = ci, 


Sn-1 
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for all r < 1, w’ € $"~!. This implies (3.28), with &” = ci, A,_1. Thus, in view 
of (3.27), p(r,w,w’) is highly peaked near w = w’ asr / 1, so the limit of 
(3.25) asr_ 7 1 is equal toc), A, f(w), for any f € C(S"~+). This justifies the 
formula (3.19) and fixes the constant: 


(3.31) — 


We summarize: 


Proposition 3.4. The map (3.1) is given by the Poisson integral formula 


1— 2 / 
(3.32) ue) = “ : ae dS(2'). 
Sr-1 


Applications of the Poisson integral formula 


We will use the Poisson integral formula to establish some further results about 
harmonic functions on domains in R”. We start with the following removable 
singularity theorem. Take B = B,(0). 


Proposition 3.5. Assume u € C?(B \ 0) C(B \ 0) is harmonic on B \ 0 and 
bounded, i.e., there exists M < oo such that 


(3.33) ju(z)| <M, Vee B\0. 

Then u can be extended (in a unique fashion) to be harmonic on all of B. 

Proof. Let f = ulag € C(OB) and set 

(3.34) v=PIf, ve C(B)NC*(B). 

We claim v = u on B \ 0. To this end, consider w = u — v on B \ 0. We have 

w € C(B\0)NC?(B\0), Aw = 0 on B\0, and w = 0 on OB. Also, |w| < 2M 

on B \ 0. We claim w = 0. To show this, we can assume that w is real valued. 

Now bring in the function H € C(B \ 0) 1 C?(B \ 0), given by 
H(a)=|z/?"-1, ifn>83, 

(3.35) 


1 
log al’ ifn = 2. 


We see that H is harmonic on B \ 0, H > 0 0n B\ 0, H = 0 on OB, and 
H(x) + +00 as « — 0. Hence, for each € > 0, there exists d9 > 0 such that 
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(3.36) eH —w>00n0B;(0), Vd € (0, do]. 
The maximum principle implies that 
(3.37) eH -—w>0 
on B \ B;(0). Taking 6 \, 0 yields (3.37) on B \ 0. Then taking € \, 0 yields 
(3.38) w<0 on B\O. 
A similar argument gives w > 0 on B\0O, hence w = 0, and the proof is complete. 


We turn now to a converse to the mean value property of harmonic functions. 
To state it, we say a function wu € C(Q) has the mean value property provided 


(3.39) u(xo) = 


u(x) dz, 
ven) | “ 
Br(xo) 


whenever the ball Bp(ao) C 2. One point of abstracting this property is that such 
functions satisfy the maximum principle: 


Lemma 3.6. If © is bounded and u € C(Q) has the mean value property on Q, 
then 


(3.40) sup |u(x)| = sup |u(y)]- 
LEQ yEOQ 


In fact, the proof of Proposition 2.7 in Chapter 3 works without change here. 
Here is our converse result. 


Proposition 3.7. [fu € C(Q) has the mean value property, then u is harmonic 
on Q. (In particular, uw € C™(Q).) 


Proof. It suffices to show that if B C is a ball, then wu is harmonic on B. 
Translating and dilating, we can assume B = B,(0). Now take 


(3.41) f=ulpp v=PIf. 
It suffices to show that v = u on B, or equivalently that w = v — u = 0 on B. 
Indeed, w has the mean value property on B, w € C(B), and w = 0 on OB, so 


Lemma 3.6 implies w = 0. 


The next result is known as the Schwarz reflection principle. To state it, let 
B = B,(0) and set 


(3.42) B,={xeE B: ay > Of. 
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Proposition 3.8. Assume u € C(B,) is harmonic on By and 
(3.43) u=0 on S= Br {ax € R”: x =O}. 


Define v on B by 


(3,44) v(x“) = u(x), if LE ae 
—u(p(z)), if p(z) € By, 

where 

(3.45) p(@1,---,;2n) = (@1,...,—Zn). 


Then v is harmonic on B. 


Proof. The hypothesis (3.43) implies v € C(B). In particular, f = vlog belongs 
to C(OB). Consider 


(3.46) w=PIf. 
We claim w = v. Since x +> p(x) is an isometry, we have PI(f o p) = wo p, so 
(3.47) w(p(x)) = —w(2). 
Hence w = 0 on S. Alsow = f = v on OB, so 
(3.48) w=u on OB,. 
Since u and w are harmonic on B,, we have 
(3.49) w=uon B,, 
which, together with (3.47), yields w = v, completing the proof. 
We turn to a circle of results on harmonic functions that satisfy one-sided 
bounds. To start, let wu : Bi(0) — R be harmonic and assume u > 0. Also 


assume u € C'(B,(0)), for now, so ulgn-1 = f > 0 and uw is given by (3.32). 
Hence, for  € B,(0), 


u(x) = (1 —|a|*)- Be OU) SANE 
y= 


(3.50) 1 ler? 


~ a+ faye 


so 
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(3.51) u(x) > ——--~——u(0), Va © B,(0). 


If we omit the hypothesis that u € C'(B,(0)) and apply this reasoning to up(x) = 
u(bax) and let b 7 1, we obtain (3.51) for this more general class. Going further, 
we can apply translations and dilations, and obtain the following result, known as 
Harnack’s inequality. In 2D, such a result arose in Chapter 3, §2, Exercise 13. 


Proposition 3.9. Assume u is harmonic on Bp(xo) and > 0 there. Then, for all 
TE Br(2o), 


1— Roa, = xo| 
(1 + R-1\a1 = xo|)"—-1 


(3.52) u(a1) > u(x). 


Using Harnack’s inequality, we can establish the following stronger version of 
Liouville’s theorem. 


Proposition 3.10. Assume v : R” — R is harmonic and bounded from below: 
v(x) > —-M, VaeER"”. 
Then v is constant. 


Proof. The function u(a”) = v(v)+M is harmonic and > 0 on R”. Given x9, 21 € 
IR”, we can take R > |x, — xo| and apply (3.52). Taking R — co then gives 


u(a1) > u(ao), V,20,%1 € R”. 
Reversing roles also gives u(xo) > u(x1), so u is constant, and so is v. 


It is useful to complement Harnack’s lower bound with an upper bound. To 
start, assume u € C'(.B,(0)) is harmonic and > 0, and complement (3.50) with 


u(x) < (1 —|a|?)- ee a AE 


(3.53) 1 |e? 
~ T= fey 
Nie) 
1+ |z| 
(3.54) u(x) < A= fepnt 0): Va © B,(0). 


We can remove the hypothesis of continuity on B,(0) by the dilation argument 
used above. Further translation and dilation gives the following complement to 
Proposition 3.9. 
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Proposition 3.11. Assume u is harmonic on Br(a%o) and > 0 there. Then, for all 
x1 € Br(20), 


1+ R|2, — Xo| 


(3.55) u(ai) < (1— Ro |x, — xo|)"7} 


u(x). 
The following result will lead to further extensions of Liouville’s theorem. 


Proposition 3.12. For each n > 2, there exist constants K,, € (0,00) with the 
following property. Let u be harmonic on Br(0) C R”. Assume 


(3.56) u(0) =0, ula) < Mon Br(O). 
Then 
(3.57) u(x) > —K,M on Br;2(0). 


Proof. Apply Proposition 3.11, with u replaced by M — u, which is > 0 on Br(0) 
and equal to / at 0. We see that 


R 3 
(3.58) |ey|= oi u(r) < 52" MM, 


so (3.57) holds with K, = 3.-2"-2 — 1. 

We can use Proposition 3.12 to give a second proof of Proposition 3.10. Indeed, 
if v > 0 is harmonic on R”, then u(a) = v(0) — v(x) satisfies (3.56), with 
M = v(0), for all R, so (3.57) implies u(x) > —K,,v(0) for all « € R”. Hence 
u is harmonic and bounded on R”, so the fact that wu is constant follows from the 
version of Liouville’s theorem given in Corollary 4.7 of Chapter 3. An extension 


of this argument gives the following. 


Proposition 3.13. Assume that u is harmonic on R” and that there exist Co, Cy, © 
(0,00) and k € Z* such that 


(3.59) u(x) <Co+C,\2|", Va eR”. 

Then there exist Cz,C3 € (0,00) such that 

(3.60) u(x) > —C2—C3|z\*, Var eR”. 

Proof. Apply Proposition 3.12 to u(x) — u(0), M = Co + |u(0)| + Ci R*. 


To proceed, we recall a result established in Proposition 4.6 of Chapter 3. 
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Proposition 3.14. Let wu: R” — R be harmonic and assume there exist Co, Cy 
and k such that 


(3.61) |u(x)| << Co+Ci|2|*, Va eR”. 
Then u(x) is a polynomial in «x, of degree < k. 

Here is a punch line. 
Corollary 3.15. In the setting of Proposition 3.13, u is a polynomial of degree 
<konR”. 
Bocher’s theorem 

Let O C R” be a connected open set, p € O. Assume 
(3.62) ueC?(O\p), Au=0,u>0onO\>p. 
Examples of such functions include 

V(z)=|z-pl?", n>B3, 

(3.63) 1 


log ———_, n=2. 
a — p| 


the latter holding provided O C B,(p) (add a constant if O is in a larger bounded 
planar domain). Bocher’s theorem says the following. 


Proposition 3.16. [fu satisfies (3.62), then there exists a harmonic function h € 
C(O) and a constant A € [0, 00) such that 


(3.64) u(x) = AV(x) + h(x), 
with V as in (3.63). 


Proof. To begin, take R > 0 such that Borp(p) C O. Let gq € OBR(p), so 
Br(q) C O \ p. By Proposition 3.11, we have 


3.65) u(x) <2u(q)(1- RO |z—ql) ?, Va € Ba(q). 
Letting g range over OBr(p), we deduce that, for some C' < 00, 
(3.66) u(x) <Clx—pl-"-), for 0< |x—pl < R. 


Consequently the restriction of u toQ = Br(p) satisfies 
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(3.67) ue LQ), Vs<—— 


n—-1 
Hence Au is a well defined distribution on 2. We have 
(3.68) Au=peD(Q), suppy c {p}. 
By Proposition 4.5 in Chapter 3, we have 
(3.69) p= Poy, 


for some constant coefficient differential operator P. Note that, for some c € R, 
Pdp = cPAV = cAPV, hence 


(3.70) A(u—cPV) =0 on Q, 
that is, uw — cPV is harmonic on 2. Hence, by (C53.67), 


(3.71) cPV EL4(Q), Vs<—-. 


n—-1 
Now, if P;, is a constant coefficient differential operator, 


(3.72) P,, homogeneous of degree k => 
P,,V homogeneous of degree 2 — n — k, about p, 


so (3.71) implies cP is a first order differential operator, i.e., cP = X + A, where 
X is aconstant coefficient vector field and A € C, so u differs from 


(3.73) XV + AV 


by a function that is harmonic on Q. Since u is real valued, X is a real vector field. 
Rotating coordinates, we can assume X is a multiple of 0,. Then a calculation 
gives 


(3.74) OV (x) = Cn(@1 — pi)|@ — pl”, 
so the hypothesis u > 0 implies X = 0. Then A > 0 in (3.73), and we have u — 
AV harmonic on a neighborhood of p, hence on all of O. This proves Proposition 
3.16. See [Tay] for a variable coefficient extension. 
Analyticity of harmonic functions 

Another consequence of the Poisson integral formula (3.32) is that PI f is real 


analytic on {2 € R” : |a| < 1}. In fact, we can continue PI f into the complex 
domain, via 
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To see how this works, set z = x + iy, x,y € R”. A computation gives, for 
we srt, 


(2-w)-(2-w) =(e-wt iy): (@-w + iy) 


3.76 
aa =) giP 3 Pe site og, 
hence 

Re=2 eeu) slenP aie 
(3.77) 


> (1—|2\)? — Iyl?, 
since |w| = 1, |a| <1 => |x —w| > 1 — |a|. Thus we have 


Proposition 3.17. Given f € C(S"~1), (3.75) defines PI f(z) as a function 


holomorphic on 
(3.78) {z=a+iyeEC": |a2| <1,|y| <1-|a}}. 
This leads immediately to our analytic regularity result. 


Proposition 3.18. Given Q open in R", u € C?(Q), harmonic on Q, it follows 
that u is real analytic on Q. 


This result is a special case of analytic regularity for solutions to Pu = f 
for a general elliptic operator P with real analytic coefficients (and real analytic 
f), refining the C’°° regularity obtained in this chapter for operators with C'°° 
coefficients. Such analytic regularity results can be found in [Mor]. We do not 
establish such general results here, but we can extend the scope of Proposition 
3.18 a bit, via a simple trick, to obtain the following. 

Proposition 3.19. Given Q C R” open, \ € C, and u € C?(Q), satisfying 
(3.79) Au— \2u =0 on Q, 

it follows that u is real analytic on Q. 

Proof. We define v on 2 x R by 

(3.80) v(x, y) = e u(x). 


Then 
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(3.81) (A, + 04)u =0 on QxR, 


so by Proposition AH.2, it is real analytic. Hence so is w. 


Here is another analyticity result. 
Proposition 3.20. Let Q C R” be open and assume u € C™(Q) satisfies 
(3.82) |A*ullowa) < CM*(2k)!, VWREZ*, 
for some C,M € (0,00). Then u is real analytic on Q. 


Proof. With J = (—a, a), to be specified, we define v on Q x I by 


(3.83) (x,y) =. 4 (-A)*u(e). 


(Formally, this is (cosh y¥—A)u(x).) The hypothesis (3.82) implies this con- 
verges in C(Q x I), provided a < 1/M, and we have 
0? 


(3.84) dy? 


—Azv. 


Again Proposition 3.18 implies v is real analytic on 2 x I, hence u is real analytic 
on). 


REMARK. Proposition 3.20 is a special case of the result known as the Kotake- 
Narasimhan theorem, [KN]. The proof given above is taken from [T2]. 


Exercises 


1. If 2 CC R” is smooth, and u € C?(Q) is harmonic, show that 


dQ 


(Hint: Set v = 1 in Green’s formula (3.12), with B replaced by 2.) 

2. Derive the mean-value property as follows. For u harmonic on a neighborhood Br = 
{z € R” : |a| < R} of 0, if 0 <r < R, it follows from Exercise | that 

# (rw) dS(w) = 2 ii ase 

a u(rw w)= Bp tlre w) = 0, 


sn-1 gn-l1 


so Avgap,.u is constant for0 <r < R. 
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3. Modify the approach to Exercise 2 to show that, if wu is subharmonic (i.e., Au > 0 on 
Br), then 
u(0) < Avgop, U- 


4. Ifu = PI f as in (3.1)-(3.2), show that, for any w € S"~", 


(3.85) w+ Vu(0) = 5 [ 970) as. 
sn-l 


Deduce that if u is harmonic on 2 C R” and p € B;(p) C Q, then 


n-1 1 
w+ Vu(p) = Anu rett if w-(y—p)u(y) dS(y) 
(3.86) aBr(p) 
n—-1 


= = AVZ9B,.(p) {w - (y — p)u(y)}. 


5. Compare the Harnack inequality arguments and consequences established in this sec- 
tion with those proposed in the 2D setting in Exercises 13-16 of Chapter 3, §2. 


4. The Riemann mapping theorem (smooth boundary) 


Let 2 be a bounded domain in C, with smooth boundary. Assume 22 is connected 
and simply connected. In particular, this implies that OQ is connected, so diffeo- 
morphic to the circle $+. Let p be a point in 2. We aim to construct a holomorphic 
function ® on 2 such that ®(p) = 0 and © : Q — D isa diffeomorphism, where 
D={z€C: |z| < 1} is the unit disk. This will be done via solving a Dirichlet 
problem for the Laplace operator on Q. 

Note that the function log |z — p| is harmonic on C \ p. Let Go(x, y) be the 
solution to the Dirichlet problem 


(4.1) AGo =0inQ, Golag = —log|z —pllao- 
As we know, there is a unique such Gp € C®(Q). Then 
(4.2) G(a,y) = log |z — p| + Go(a, y) 


is harmonic on 2 \ {p} and vanishes on 02. This is a Green function. 
We next construct Hy € C®(Q), the harmonic conjugate of Go. It is given by 


_ f%p aGo AG 
(4.3) Ho(z) [I By ott Ga ey: 


the integral being along any path from p to z in Q. Green’s theorem, and the 
harmonicity of Go, imply that the integral is independent of the choice of path. 
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Making appropriate choices of path, we readily see that 


(4.4) = = 


so Go + 7H is holomorphic. Now 

(4.5) A(x,y) = Im log(z — p) + Ho(z,y) 
is multivalued, but 

(4.6) B(z) = Ct! = (z — p)eGo tHe 


is a single-valued holomorphic function on 2, with ®(p) = 0. Note that z € 
OQ +> G= Re(G+iH) =0, so 


(4.7) o.0n —s" 
and hence, by the maximum modulus principle, 
(4.8) ®@:0— > D. 


The Riemann mapping theorem (for this class of domains) asserts the following. 
Theorem 4.1. © is a holomorphic diffeomorphism of Q onto D. 


Proof. We must show that ® : 2 — D is one-to-one and onto, with nowhere- 
vanishing derivative. This will be easy once we establish that 


(4.9) y= |, :02 — Ss" 


has nowhere-vanishing derivative. Note that since G| 9. = 99 = et | aq: 
view of the Cauchy—Riemann equations yielding holomorphy of G + 7H, to say 
that the tangential derivative of H on OQ) is nowhere zero is equivalent to saying 


(4.10) oe (2) #0, forall z € 0Q. 
Vv 


On the other hand, since G(z) + —oco as z > p, G(z) is maximal on 0Q, and 
so Zaremba’s principle implies (4.10). Thus (4.9) is a local diffeomorphism, and 
hence a covering map. To finish off the argument, we make use of the following 
result, known as the argument principle. 


Proposition 4.2. Let 6 € C1(Q) be holomorphic inside Q, a bounded region in 
C with smooth boundary, OQ = +. Take q € C, not in the image of y under ®. 
Then the number of points p; in Q, counting multiplicity, for which ®(p;) = q is 
equal to the winding number of the curve ®(y) about q. 
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Here, if ®(p;) = g, we say p; has multiplicity 1 if &’(p;) A 0, and we say it 
has multiplicity k + 1 if ®’(p;) = --- = 6™)(p;) = 0 but B+) (p;) 40. A 
proof of this elementary result can be found in most complex analysis texts, e.g., 
[Ahl], [Hil], or [T3]. A substantial generalization of this result can be found in 
Exercises 19-22 in the set of exercises on cohomology, after §9 of this chapter. 

Now, to finish off the proof of Theorem 4.1, we show that the map (4.9) has 
winding number 1, by appealing to the argument principle, with gq = 0. We see 
from (4.6) that p is the unique zero of ®, a simple zero. Hence the map (4.9) is 
a diffeomorphism. Thus, again by the argument principle, any qg € D is equal to 
®(w) for precisely one w € 2. This implies ®’(w) 4 0 for all w € 2, and the 
proof of Theorem 4.1 is complete. 

Remark: A common proof of Proposition 4.2 starts like this. Suppose gq ¢ ®(7) 
and ®(z) — q has k roots in Q, counted with multiplicity; call then p;, 1 <j < k, 
with p; € Q, repeated according to multiplicity. Then, on Q, 


k 


(4.11) &(z) -—q= |] (z-p,)- 42), 


j=l 


with YW € C1(Q), holomorphic on 2 and nowhere zero on 22. The Leibniz formula 
gives 


(4.12) a 
BQ) —-a 442-y * WE) 

Hence 

1 ®'(z) aa dz 1 f W'(z) 

ae fe ay d 
(4.13) Qi / ®(z) —q ° Ds ani J 2—p; * maf W(z) 7 

a j=l aa an 
=k 


the latter identity by the Cauchy integral theorem. 


Rather than identify (4.13) with the winding number of &(y) about g, we can 
finish the proof of Theorem 4.1 as follows. The identity (4.13) shows that when- 
ever g # ®(74), the left side of (4.13) is an integer. On the other hand, this quantity 
is clearly continuous in g on each connected component of C \ ®(7), hence con- 
stant on each such component. In the setting of Theorem 4.1 , ®(y) = S*, and 
(4.13) is seen to be equal to 1 for g = 0. Hence (4.13) is equal to 1 for all g € D. 

Two smooth, bounded domains in C that are homeomorphic may not be holo- 
morphically equivalent if they are not simply connected. We discuss the analogue 
of the Riemann mapping theorem in the next simplest case, when (2) is a smooth, 
bounded domain in C whose boundary has two connected components, say yo 
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and 7. Assume 70 1s the “outer” boundary component, touching the unbounded 
component of C \ 2. 
In such a case, let u(x, y) be the solution to the Dirichlet problem 


(4.14) Au=0onQ, ul, =0, =-1, 


u 

Al 
Given c > 0, consider G = cu, which will play a role analogous to the function 
G in (4.2). Consider the 1-form 


aG aG 
(4.15) B= a" de + 5 dy, 


which is closed by the harmonicity of G. If yo is oriented in the clockwise 
direction, we have Ie B = cA, for 


Ou 


Yo 


Hence there is a unique value of c € (0, 00) for which [ , 2 = 2m. In that case we 
can write = dH, where H, a harmonic conjugate of G. is a smooth, real-valued 
“function” on Q, well defined mod 277. Hence 


(4.16) W(z) = C4 


is a single-valued holomorphic function on 22. It maps +g to the circle |z| = 1 and 
it maps ‘; to the circle |z| = e~°. Using Zaremba’s principle as in the proof of 
Theorem 4.1, we see that © maps 7 to St = {z : |z| = 1} and y to {z: |z| = 
e~°} (which we denote $2), locally diffeomorphically. The fact that Le B=20 
implies that yo is mapped diffeomorphically onto $+. Similarly, 7; is mapped 
diffeomorphically onto S!. From here, an application of the argument principle 
yields: 


Theorem 4.3. Jf is a smooth, bounded domain in C with two boundary compo- 
nents, and W is constructed by (4.14)-(4.16), then V is a holomorphic diffeomor- 
phism of Q onto the annular region 


(4.17) A,={z~EC:p<|z|/<1}, p=e™. 


It is easy to show that if 0 < p < o < 1, then 2, and 2, are not holomorphi- 
cally equivalent. If there were a holomorphic diffeomorphism F' : 2, — 2,,, then, 
using an inversion if necessary, we could assume F maps |z| = 1 to itself and that 
it maps |z| = p to |z| = o. Then, applying the Schwartz reflection principle an 
infinite sequence of times, we can extend fF to a holomorphic diffeomorphism of 
D = {|z| < 1} onto itself, preserving the origin. Then we must have F(z) = az, 
|a| = 1 (see Exercise 4 below), which would imply p = o. 
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Exercises 


1. Let 2 be the unit disk inC, f Ee C°(S ty, real-valued, with mean zero. Let u = PI f. 
Show that, for g € C'™(S") real-valued, v = PI g is the harmonic conjugate of u 
(satisfying v(0) = 0) if and only if 


where H is the operator (2.16) of Chap. 3, that is, 


co 


Hf(0) = > (sgnk) f(k) e””. 


k=— oo 


2. Let Q be a bounded, simply connected domain in C, and suppose F : 2 — D is 
holomorphic, taking OQ to OD. Suppose F(p) = 0, p € Q, F’(p) # 0, and F has 
no other zeros. Show that F(z) = (z — p)ef™ with f : Q — C holomorphic, and 
eR f(@) — |x — p|~! on AQ. Use this to motivate the constructions used in this section 
to prove the Riemann mapping theorem. 

3. Given a,b € C, |a|? — |b|? = 1, set 


We say A € SU(1,1). Define the map 


+b 
i rae 


Show that each such F'4 maps D one-to-one and onto itself. Show that Fag(z) = 
F'4(F'p(z)). Show that, for any g € D, there exists A € SU(1,1) such that Fa(q) = 
0 


4. Suppose F' : D — Disa holomorphic diffeomorphism such that F'(0) = 0. Show that 
F(z) = az, for some a € C, |a| = 1. (Hint: Consider the behavior of F'(z)/z and of 

5. Deduce that every holomorphic diffeomorphism F' : D — D is of the form F'4 of 
Exercise 3. (Hint: First construct F'4, such that F'4, o Fo F'4,(0) = 0.) 

6. Given p € Q, simply connected, etc., show that there is a unique holomorphic diffeo- 
morphism ® : Q — D such that 6(p) = 0 and ©’(p) > 0. 
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Let be an open connected subset of the interior of MM, a smooth, compact, 
connected Riemannian manifold with nonempty boundary. The boundary of 2 
can be quite wild. We want to formulate and study the Dirichlet problem for the 
Laplace operator on 22. 

Let us start with a function y € C®(M) given; let y = y| aq: We want to find 
u € C%°(Q) such that 
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(5.1) Au = 0inQ, and bles = W, in some sense. 


In the best cases, we will have u € C(Q), but not always. 

A particular example of the problem we’re interested in solving is the follow- 
ing, when 2 is a bounded open subset of R”. Let E(x) be (a multiple of) the 
fundamental solution to the Laplace equation on R”, with pole at p, given by 


(5.2) 


We want to construct a function G,, harmonic on 2 \ {p}, with the same type of 
singularity as E’, at p, so 


AG» = €ndp} Cree = 0, in some sense. 


Recall from §4 that such a function was constructed for Q CC R? with smooth 
boundary, as a tool to prove the Riemann mapping theorem for smooth, simply 
connected domains. One motivating force pushing our analysis here will be to 
generalize Theorem 4.1 to an arbitrary bounded, simply connected domain Q in 
C, with no smoothness assumptions whatsoever on 02. To relate G'‘, to (5.1), note 
that if we write 


(5.3) Gp = Ey + F, 


then u = F' solves (5.1), with ~ = —Ep| a0: We then have wy = Yaa where 
y = —xE,, with xy € C°°(R”) equal to zero on a neighborhood of p, 1 on a 
neighborhood of 02. 

A construction of the solution to (5.1) is given by 


(5.4) u=v+y, 
where v is defined by 
(5.5) vé€ Hj(Q), Av=—-Ay=. 
See (1.47)-(1.48) for unique solvability of (5.5). We proceed to give a more pre- 
cise sense to the assertion that u| as w. 
We will analyze the behavior of the solution u defined by (5.4)-(5.5) by the 


following limiting process. Pick a sequence of connected domains 12; with smooth 
boundary such that 


(5.6) NCC Os, (JO, =2. 
J 
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Then as shown in §1, we have u; € C™(Q;) such that 


(5.7) Au; = 00nQ,;, Usloa, =; = van," 
Parallel to (5.4)-(5.5), we have 

(5.8) Ug = U5 + YF, 

where 

(5.9) 5 = ¥la,s 

and v; € C%° (9;) is uniquely determined by 

(5.10) vj € Hy(Q;), Avj=o; =O], , 


with ® € C™(M) defined as in (5.5). Extending each vj; € C'°(();) to be zero 
in 2 \ Qj, we can regard each v; as an element of Hj(). We then have the 
following. 


Lemma 5.1. The set {v,;} is bounded in Hj(Q). 
Proof. We have 

lly llFr(a) = lesllFn@,) = llevsllz2(@,) + lloyllZ2@,)- 
By (5.10), 
(5.11) [[dvl|Z2 = —(Avj,vj) = —(®, vj) < ||®llz2llesllz2. 
Now there is a constant /¢ such that 
(5.12) \|u||p2 < K||dul|z2, for all u € H9(Q), 


indeed, for all u € Hj(M). Inserting this estimate into (5.11) and cancelling a 
factor of ||dv;||,2, we have 


(5.13) \|dv;l]z2 < K ||P] 2. 


Appealing again to (5.12), we have a bound on the H'(()-norm of v;. 


Since any closed ball in the Hilbert space H}(Q) is compact and metrizable 
in the weak topology, any subsequence of {v,;} in turn has a weakly convergent 
subsequence. Any limit must satisfy (5.5). Since the solution to (5.5) is unique, 
we have 
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(5.14) vj —> v weakly in Hy (0). 
Since Av; = on 2,, we deduce from interior regularity estimates the following. 
Lemma 5.2. Let O CC Q; say O CC 7. Then 
ae : 7 > J} is bounded in C™(O). 

It follows that 
(5.15) vj —>v inC?(O). 
Thus, by (5.4) and (5.8), 
(5.16) u; —>uinC™(O), foreachO cc. 


We can use this to obtain the following version of the strong maximum principle 
for u. 


Proposition 5.3. The function u defined by (5.4)-(5.5) satisfies 


(5.17) inf w(y) < u(x) < sup vy), EQ, 
a2 aa 


unless u is constant. 
Proof. For uj € C%°(Q,;), the strong maximum principle established in §2 


implies 


(5.18) inf y(y) < u;(xz) < sup v(y), forex E07, 7 > J 
OQ; 0Q; 


(unless u,; is constant). It follows from (5.16) that 


(5.19) inf p(y) < u(x) < sup yy), 
00 


for all « € Q,, for all J, that is, (5.19) holds for all « € Q. Since the strong 
maximum principle holds for ul gq, We see that, unless ul 0, is constant, 


inf u(y) < u(x) < sup u(y), forw EO, 7 > J, 
0Q,; aa; 


so the estimate (5.17) follows. 


One obvious consequence of Proposition 5.3, or even of (5.19), is that wu is 
uniquely determined by 7 = y| aq independent of the extension y to M. We 
hence have a map 
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(5.20) PI: E(0Q) —> EP (Q)N C%(Q), 


where E€(0Q) denotes the space of restrictions of elements of C°(M) to 09. In 
(5.20), Pl » = u for wy = Claes with w given by (5.4)-(5.5). This map preserves 
the sup norm. It follows from the Stone—Weierstrass theorem that €(0Q.) is dense 
in C(0Q), so there is a unique continuous extension 


(5.21) PIs C(80) > L(y nem). 


If w € C(OQ), u = PL w satisfies Au = 0 in Q. It is clear that Proposition 5.3 
continues to hold for u = PI w, given w € C(0Q). 

We now examine conditions for PI / = u to be continuous at a given boundary 
point z € OQ, involving the use of barriers. By definition, a function w € C?(Q) 
is a barrier at zg for 2 provided Aw < 0inQ, w(x) > 0 as x > Zo, and, for any 
neighborhood U of zo in M, there is a 6 > 0 such that 


(5.22) w(x) > 6, forx Ee QO\U. 


There are more general concepts of barriers, and we will use some of them later 
on, though for clarity we will give them different names, like “weak barriers,” and 
so on. A point zo € OQ) is called a regular point provided the conclusion of the 
following proposition holds. 


Proposition 5.4. [f there is a barrier at zo € OQ, then, for w € C(AQ), 
tic PT ab, 


(5.23) lim u(x) = (Zo). 


xL— Zo 
Proof. By a simple limiting argument, it suffices to prove the result for 7 € 


E(0Q); suppose y = el es p € C™©(M). Fix « > 0. Then there exists k > 0 
such that, for each j, 


(5.24) —e—kw+ (zo) < uj(x) < v(zo) te + kw 

on 00,;. This is arranged by picking k so large that |y(y) — y(zo)| < ¢ + kw 
on OQ; for all j, so (5.24) holds on 0Q;. By the maximum principle, (5.24) must 
hold on Q, if w satisfies Aw < 0. Letting 7 > oo, by (5.16), we have 

(5.25) —e—kw(x) + v(zo) < u(x) < v(zo) te + kw(z), cE. 


Since w(x) + 0 as x — 2p, this implies 


(5.26) p(zo) —€ < liminf u(x) < limsup u(x) < y(zo) +6, 


L209 LZ 


for all ¢ > 0, which proves the proposition. 
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It turns out to be easier to construct an object that we call a weak barrier, 
defined as follows. A function w € C?(Q) is a weak barrier at z € OO provided 
Aw <0inQ, w(x) > 0 for z € Q, and w(x) > Oas x > Zp. 

We give a couple of examples of barriers and weak barriers, for planar domains. 


Proposition 5.5. Let Q Cc R? = C, 2 € OO. Suppose there is a simple curve 
4, lying in C \ Q, connecting zp to co. Then there is a barrier at zo, so zo is a 
regular point. 


Proof. Cut C along 7; for any K > 0, log[(z — 20)/K] can be defined as a 
single-valued holomorphic function in C \ y. For K > diam 2, the harmonic 
function 


1 
log ( — ) 


is easily verified to be a barrier, so zo is a regular point. 


(5.27) V=—Re 


We note that if (z — zo) /K = re’, with @ continuous on C \ ¥, then 


logr 


A larger class of planar domains is treated by the following result. 


Proposition 5.6. If Q C C is any bounded, simply connected domain, z € OQ, 
then we can define a single-valued branch of (5.27) on Q, which will be a weak 
barrier function. 


Proof. This is clear. Note that the conclusion also holds if Q is contained in a 
simply connected region 20’, with zp € 00’. 


We remark that there exist domains satisfying the hypotheses of Proposition 
5.6, for which V, given by (5.27), is not a genuine barrier, in the sense of the 
first definition. We indicate one example in Fig. 5.1. The region 22 illustrated there 
winds infinitely often around the circle that is its inner boundary, and zo lies on 
this circle. Below we will show that whenever a weak barrier exists, then a genuine 
barrier exists. Indeed, somewhat more will be demonstrated, in Proposition 5.12. 

First, we show how to use the concept of weak barrier directly to examine 
the continuity at the boundary of Green functions (5.3). Let G,; be such Green 
functions defined on the domain (2;, with smooth boundary, so 


(5.29) Gpj = Ep + Fy, 


where Fy € C™(Q,;) satisfies AF; = 0, Fj|,. = —E 


plan," Thus 
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FIGURE 5.1 A Winding Region 


(5.30) Gp — Gpj =F-F#; on Q;, 
and hence, by (5.15), 
(5.31) G, — Gp; —> 0, inC™(O), forO CCQ. 


Since Gye(z) + —0co as x > pand Gye(x) = 0 for e € OMe, then, by the 
maximum principle, 


Gpe(x) <0, fora € Qe \ {p}, 


and hence for z € Q; \ {p}, for all £ > j, so we certainly have G,(x) < 0 
on 2. \ {p}. Applying the strong maximum principle on 2;, 7 —> 00, we can 
strengthen this to 


(5.32) G,(x) <0 onQ\ {p}. 


Now we show directly that weak barriers yield continuity of the Green function 
G, at boundary points. 


Proposition 5.7. Let z € OQ. Suppose there exists a function V € C?(Q) that 
is a weak barrier at z. Then G',(«) + 0 as x + Zo. 


Proof. Fix a compact set kK C (1, containing a neighborhood of p. Then there 
exists a & such that, for all 7, 
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-kV < Gp; on OQ; \ Kk), 


since we know {G',;} is uniformly bounded on OK, and G,,; = 0 on 02);. Then, 
by the maximum principle, 


By (5.31) and (5.32), we have 


(5.33) —-kV <G, <0 onQ\K, 


which yields the proposition. 


Propositions 5.6 and 5.7 will suffice for our treatment of the Riemann mapping 
theorem for general, simply connected domains in the next section, but we will 
proceed with some further results. 

First, we consider local versions of barriers. A function w € C?(Q) is a local 
barrier (resp., weak local barrier) at z9 € OQ provided there is a neighborhood 
U of z in M such that wo ne is a barrier (respectively, weak barrier) at zo for 
QU. 

The motivation for studying this concept is that local barriers and weak local 
barriers are frequently easier to construct than their global counterparts. However, 
when the local objects exist, their global counterparts do, too. This is easy to prove 
for (genuine) barriers. 


Proposition 5.8. [f w is a local barrier at zo, then there exists a barrier for Q 
at 2, equal to w in some neighborhood of zo. 


Proof. Let f : R + R be a C™-function, with f(0) = 0, f’(0) > 0. A simple 
calculation shows 


(5.34) Af(u) = f'(u)Aut f"(u)|dul?, 


where | du|? = g/*(x)0;u Oyu. Thus, if Aw < 0 on QNU, we have Af(w) <0 
on 2. U provided 


(5.35) fi(u)>0, f’(u) <0. 


Take f to be such a function, with the additional property of being identically 1 
for u > 6, so the graph of f is as depicted in Fig. 5.2. 

If w satisfies the barrier condition on QU and in particular w(x) > 6 outside 
U, CC U, define wi (x) by 


wi(z)= f(w(a)), fora EeU,NQ, 


5.36 
2) : forz €Q\U;. 


Then wz} is a barrier for Q at zo. 
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The argument above does not work if w is a weak local barrier. We now tackle 
the problem of dealing with weak local barriers. 


Proposition 5.9. If w is a weak local barrier for Q. at zo € OQ, then there exists 
a local barrier for Q at zo. 


Proof. Start with the function 


2 
(5.37) h(x) = 5° (2; — 20,5) 

j 
in a local coordinate patch about zo, chosen to be normal at zo. Thus h(zo) = 0 
and this is the strict minimum of h. While A may not be the flat Laplacian Ao in 
these coordinates, their coefficients do coincide at zo. Thus, if U; is a sufficiently 
small neighborhood of zp in M, 
(5.38) Ah(z) >C>0 inUy. 


Also pick U; sufficiently small that w is a weak barrier for 2.9 Uy at zo. Then 
define w to be the Poisson integral of hl a(ant)’ where 


PI: C(A(QNU))) — LX(QNU)). 


We claim that wy, is a barrier for QM Uj, hence is the desired local barrier. Clearly 
Aw, = 00n 2M Uj. Next we claim that 


(5.39) wi(2) > h(x), forzx EQNU,. 
Indeed, we can write 


(5.40) wi(z) = lim u;(z), 


jroo 


FIGURE 5.2 Graph of the Function f 
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where u,; are harmonic on O; 70 U4, with OO; smooth and 


ay lao, = h(x), 
and we can apply the maximum principle, Proposition 2.1, to h — u;. This proves 
(5.39). To prove Proposition 5.9, it remains to establish that 


(5.41) lim w(x) = A(z) = 0. 


LZ 
This takes some effort, so we will take a break and advertise this formally. 
Lemma 5.10. The function w, constructed above satisfies (5.41) inQQ Uy. 


Proof. For convenience, we relabel 1. Uj, calling it 9; also denote O; above 
by 2. Recall that we are working in an exponential coordinate system centered 
at 2; in particular, g;~(20) = djx. Let B, be the ball of radius p in R” centered 
at the origin (identified with zo), as illustrated in Fig. 5.3. Assume p > 0 is small. 
We can suppose that 0B, 1Q # (). Let F be a compact subset of 0B, 1 such 
that the (n — 1)-dimensional measure of 0B, 1 Q \ F is less than p/2 times 
the measure of 0B,. Assume that F C OB, has a smooth (n — 2)-dimensional 
boundary. 

Let f be the product of the characteristic function of 0B, N Q with a non- 
negative C’ function, < 1, equal to 1 on 0B, NQ \ F, such that 


FIGURE 5.3 Setup for Local Barrier Construction 
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1 
(5.42) —_ / If? dS < p, 
A, 
OB 


where A, is the area of 0B,. Then define 


(5.43) feCr ene), 
by 
(5.44) Aq=0 on B,, dos, =F, 


Results presented in §2 (cf. Proposition 2.9) imply that there is a unique such q, 
and such q also satisfies ||q||z-0(,) < 1, and 


(5.45) q(x) —> f(y) as ty, Vy € OB, \ ag; 
in particular, 
(5.46) q(x) — fly) a ray, Vy€ OB,NQ. 


The mapping property PI : L?(0B,) + C™(B,) (cf. Proposition 1.8 plus (5.42) 
give 


(5.47) 0 < q(z) < Cp”. 


Remark: In case A = Ap = 07 +--+ + O?, the results (5.43)-(5.47) also follow 
from the formula for PI in §3. One can replace (5.42) by the condition that f have 
mean value < p on OB,, and then (5.47) is sharpened to q(zo) < Cp, by the mean 
value property. 


Now let 
(5.48) M = suph(az) = sup wi (2), 
Q Q 
let 
(5.49) k= inf w(x) > 0, 


where w is the weak (local) barrier hypothesized in Proposition 5.9, and consider 
(5.50) s(x) = wi(z) — p— —w(x)—- Md(z), tEO,, 


where O, = (1M B,. We know that 
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(5.51) s(x) = lim s,(z), 
joo 
where 
M —— 
(5.52) 8;(z) =u; (x) — p- — w(x) — Mq(x) on QD; N Bp, 


u,; being given by (5.40). Now s,(x) is continuous on 2; B,, and As; >0 
on the interior, by (5.47). Also, using (5.45), and noting that, on 0Q; M Bp, 
uj =h <p’ < p for p small, we see that 


(5.53) s;(2) <0 


on 0(0Q; 9 B,) = (00; 9 B,) U(Q; N OB,). By the maximum principle, (5.53) 
holds on 2; M Bp. Passing to the limit gives s(x) < 0 on Q./M Bp, hence 


M 
(5.54) wi(a) < p+ = w(x) + Matz), forz €2N Bp. 
This implies 
(5.55) limsup w(x) < p+ MCp"?, 


@—>z9,£EByN2 
since w(x) — 0 as 7 — 2p, and hence 


(5.56) limsup w (a) <0. 


@— Zo 


Together with (5.39), this gives (5.41). The proof that wy is a barrier is complete; 
hence so is the proof of Proposition 5.9. 


Combining Propositions 5.4, 5.8, and 5.9, we have the following conclusion, 
essentially due to G. Bouligand. 


Proposition 5.11. Given zp € OQ, the following are equivalent: 


(5.57) there is a weak local barrier at zo; 
(5.58) there is a barrier at Zo; 
(5.59) zg is a regular point. 


Proof. To close the argument, we show that (5.59) = (5.57). Indeed, given zo € 
OQ, define f € C(OQ) as f(x) = dist (x, zo), and set w = PI f. We see that if zo 
is aregular point, then w is a weak barrier, and a fortiori a weak local barrier, at zo. 


We next record the following consequence of localizability of the concept of a 
regular point. 
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Corollary 5.12. Suppose Q and! are open subsets of M, with a common bound- 
ary point 29. Suppose there is a neighborhood U of zo such that 


(5.60) YNUDANU. 
If zo is regular for Q’, then 2 is regular for Q. 
Proof. A barrier at zp for 9’ gives a local barrier at zo for Q. 


As an application, we can localize the result on simply connected planar 
domains given in Proposition 5.6, to obtain the following result of H. Lebesgue. 


Proposition 5.13. Let Q C R? be an open, bounded, connected region, and let 
zo € OQ. Suppose the connected component of OQ containing zo consists of more 
than one point (i.e., is a “continuum” ). Then Zo is a regular point. 


Proof. Let T' be the connected component of 02 containing z . Let 22’ be the 
connected component of C \ T containing 2. Thus 00! = T. If 0’ is bounded, 
then 0’ connected implies that the planar domain 1’ is simply connected, so, by 
Proposition 5.6 , zo is regular for 2’. Hence, by Corollary 5.12, zo is regular for 
Q. in this case. 

On the other hand, if 9’ is not bounded, pick z,; € I’, at a maximal distance 
from Zo (z; # 2, under the hypothesis of the proposition). Let p denote the ray 
from 2; to infinity, directly away from zo. Then let 2” be 1’ \ p, intersected with 
some disk D of large radius centered at 21, so 02” = TU (pn D) UOD. Thus 
Q” is simply connected. Since 2” coincides with ’ on a neighborhood of zo, we 
again have zp regular for 2, and the proof is complete. 


It is not hard to show that an isolated boundary point of 2 is always irregular, 
when dim 22 > 2. This can be obtained as a consequence of the following simple 
result. 


Proposition 5.14. The space F of functions in C§°(R") vanishing in a neighbor- 
hood of a given point p is dense in H'(R") ifn > 2. 


Proof. The annihilator of F is the space of elements of H~!(R™) supported at p. 
But any distribution supported at p is a linear combination of derivatives of the 
delta function 5,, and none of these belong to H~1(R"), except for 0. 


More generally, we say a compact set K in the interior of / is negligible if it 
supports no nonzero elements of H~'(M). For example, a smooth submanifold 
of codimension > 2 is negligible. 


Proposition 5.15. Suppose a boundary point z) € OQ has a neighborhood U in 
M whose intersection with OQ. is negligible. Then zo is an irregular boundary 
point. 
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Proof. Let 2’ = QUU. Thus 2p is an interior point of 0’. The hypothesis implies 
Hi(M) = HA(Q). 


Now, pick f € E(O0Q) such that f = 0 near zo, f > 0 on all of OQ, and f > 0 
somewhere. We claim that if we consider 


u=PIf; PI: C(0Q) —- C™(Q), 


then u(x) does not tend to 0 as 7 — 2p. Indeed, u is simply the restriction to 2 of 
u’ = PI f’, where 
PI: C(00’) — C™(0') 


and f” is the restriction of f to 0’. The strong maximum principle implies that 
u’ > 0 everywhere in 11’, including at zo, so the proof is done. 


N. Wiener obtained a precise characterization of regular and irregular points in 
terms of “capacity,” a certain countably subadditive set function defined on Borel 
sets. A boundary point z) € OQ is irregular provided the capacity of OQ inter- 
sected with a small ball centered at z) decreases fast enough. The negligible sets 
defined above are precisely the compact sets of capacity zero. This characteriza- 
tion has a natural probabilistic analysis, using the theory of Brownian motion. In 
Chap. 11 we will discuss Brownian motion and present such a proof of Wiener’s 
theorem. 

We will derive one more sufficient condition for zo € OC to be a regular point, 
due to S. Zaremba. 


Proposition 5.16. Let 2 be a bounded, open, connected subset of RR”, with its 
flat metric. Suppose z9 © OQ. and there exists a cone C with vertex at zo such that, 
for some ball B centered at Zo, 


(5.61) BNC \ {zo} CR” \Q. 

Then z is a regular point for Q. 

Proof. By Corollary 5.12, it suffices to show that zo is a regular point for B \ C, 
where G is some ball centered at zo. We can translate coordinates so that zo is the 
origin. We will construct a weak barrier for B \ C at zo = 0 of the form 


(5.62) u(x) =r*gol(w), c=rTrw, 


where (yo (w) is an eigenfunction of the Laplace operator Ag on the region O = 
S”~1\ C, an open subset of the sphere with nonempty smooth boundary: 


(5.63) yo € Hj(O), Asyvo = —LY¥0, 


and ys > 0 is the smallest eigenvalue of —As on O. The formula for the Laplace 
operator in polar coordinates 
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(5.64) Av=— + + 
: 


shows that (5.62) defines a function harmonic in 2; = B \ C, continuous on 4, 
and vanishing at zo, if we take 


(n — 2)?71/2 


(5.65) a=Vv— ri 


Note that a > 0. Furthermore, as shown in Proposition 2.4, since js is minimal, the 
eigenfunction Yo is nowhere vanishing on the interior of O; thus it can be taken 
to be positive there. This makes v a weak barrier and completes the proof. Note 
that if the cone is shrunk, such construction produces a (genuine) local barrier for 
Q) at zo. 


We remark that, by considering v1 = f(v) with f’ > 0, f” < 0, f(0) = 0, we 
can construct a weak barrier v; satisfying Agu; < —C < 0 on B\C, where Ag = 
0? +--+-+ 0? is the flat Laplacian considered in Proposition 5.16. Such v; would 
also be a weak barrier with the flat Laplacian Ap replaced by a small perturbation, 
of the form (5.44). In this way we can obtain an analogue of Proposition 5.16 for 
the general class of subdomains of a Riemannian manifold with boundary, which 
we have been dealing with in this section. Details are left as an exercise. 


Exercises 


1. Suppose 2 is a bounded region in R”. Suppose there is a point p € R” \ Q such that 
zo is the point in OE. closest to p. Show that 


(5.66) w(x) =—|a —pl|?-” (log |a — p| ifn = 2) 


is a barrier at zo. Note that such p exists provided there exists a sphere in R” \ Q, 
touching OQ precisely at zo. If this happens, we say ( satisfies the exterior sphere 
condition at zo. Show that this condition holds for every C?-boundary, but not for every 
C*-boundary. Show it holds whenever Q is convex. 

2. Denote by C*(AQ) the space of restrictions to OO of elements of C*(R”). Let f € 
C? (IR”), and assume that 2 satisfies the exterior sphere condition. Given zo € OQ, 
show that there exists a barrier of the form (5.66) and a K < ov, such that, for all 
z€a0Q, 


—K [w(z) — w(z0)] < f(z) — f(z0) — (2 — 20) - VF(z0) < K[w(z) — w(20)], 


with strict inequality except at z9 € OQ. Deduce from the maximum principle that such 
an inequality holds inside Q, for u = PI(f). When can you deduce that 


PI: C?(8Q) — Lip(Q)? 


(Hint: Look for uniform estimates on (;.) 
When can you replace Lip(Q) by C1 (Q)? 


428 5. Linear Elliptic Equations 


3. Replace barriers (5.66) by barriers of the form (5.62), and obtain boundary regularity 
results for more general domains Q and less regular f, such as 


PI: Lip(Q) —+ C*(Q), 


for an appropriate class of domains 2 C R”, withO <a <1. 
For a systematic treatment of Hélder estimates, see Chap.6 of [GT], and references 
given there. 

4. For the Green functions G,; on Q; 7 Q, approaching G’y as in (5.31), show that 


Go \ Gp: 


5. For the approximating solutions v; € HG (Q;) in (5.9)-(5.14), show that (5.14) can be 
strengthened to 
vj; —> v inthe H'(Q)-norm. 
(Hint: Show that ||dv;||7,2 — ||dv||7,2.) 
6. Show that if Q C R” is open and bounded (with smooth boundary) and Au = f on Q, 


th a = 0, then 
(5.67) D f la:aeuto)|? dx = / |Au(x)|? dx + (n — 1) /| ou A (e) dS(«x) 
: - 3 Ov , 
ik O Q role) 


where H(z) is the mean curvature of OQ (with respect to the outward-pointing normal). 
This is known as Kadlec’s formula. 
7. Using Exercise 6, deduce that, for Q convex, but with no other regularity assumed, 


u € Hj(Q), Au=f €L?(Q) > we HM). 


(Hint: Look for uniform estimates on 2.;. Each mean curvature H;(x) is < 0.) 
Compare results in [Gri]. 


6. The Riemann mapping theorem (rough boundary) 


Let Q be a bounded open domain in R? = C which is connected and simply 
connected. We aim to construct a one-to-one holomorphic map 


(6.1) 6:2 3D 


of 2 onto the unit disk D. The construction of ® will be similar to that given in 
84 for domains with smooth boundary, but the proof that (6.1) is one-to-one and 
onto will be slightly different from the smooth case, and of course the conclusion 
will be weaker. 

With p € { given, we take the Green function G = G), constructed in 85. 
Thus AG = cd, (c > 0), G(z) < 0 on \ {p}. By Propositions 5.6 and 5.11, 
every point of OQ is regular, so lim,_,,, G(z) = 0 for each zp € OX (ie., G is 
continuous on 2 \ {p}). We can write, for z € Q, 
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(6.2) G(z) = log |z — pl + Go(2), 


with Gp € C(Q) NC%(Q), AG = 0, and we can construct the harmonic conju- 
gate of Go, 


(6.3) HeeCrw, 


as 


. fr 86s AG 
(6.4) Ho(z) / By tt Ga ey 


the integral being along any path from p to z in Q, as before. As opposed to the 
case where OQ is smooth, in general we cannot guarantee that Hyp € C(Q). In 
particular, there is no guarantee that extends continuously to 2 unless some 
restrictions are placed on OQ. As before, ® is defined by 


(6.5) B(z) = eC! = (z — p)eGo tho, 


where H(z) = Im log(z — p) + Ho. We aim to prove the following Riemann 
mapping theorem. 


Theorem 6.1. [f Q is a bounded, simply connected domain, then the map ® : 
Q > D given by (6.5) is one-to-one and onto. 


Proof. Since G is continuous on 2 \ {p}, we see that 
(6.6) |6| : OQ —> [0, 1] 


is continuous, hence uniformly continuous; it takes OQ to {1}. Fix « > 0. If 
Ye C Q is a simple closed curve, enclosing p, which stays sufficiently close to 
OQ, then 


(6.7) de = (Ye) CD\ Dic, 


where D, = {z € C: |z| < p}. By the argument principle, for any c € Dy_<, 
the degree of o- about c is equal to the number (counting multiplicities) of points 
qj € Qe (the region enclosed by 7-) such that ®(q;) = c. This winding number is 
independent of c € D;_-. But for c = 0, we see from (6.5) that p is the only zero 
of ®, a simple zero, so the winding number is one. Thus, for all c € D,_<, there 
is a unique g € 2 such that ®(q) = c. Letting « + 0, we have the theorem. 


As noted in Exercise 6 of 84, such a map ® is essentially unique. It is called 
the Riemann mapping function. 

The Riemann mapping function ® does not always extend to be a homeo- 
morphism of 2 onto D; clearly a necessary condition for this is that OQ be 
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Jy 


me 


FIGURE 6.1 Approaching the Boundary 


homeomorphic to S | that is, that it be Jordan curve. In fact, C. Caratheodory 
proved that this condition is also sufficient. A proof can be found in [Ts]. Here we 
establish a simpler result. 


Proposition 6.2. Assume Q C R? is a simply connected region whose boundary 
OQ is a finite union of smooth curves. Then the Riemann mapping function ® 
extends to a homeomorphism ® : Q — D. 


Proof. Local elliptic regularity implies G‘g and hence Hp and ® extend smoothly 
to the smooth part of OQ. Also, an application of Zaremba’s principle as in §4 
shows that the smooth parts of OQ are mapped diffeomorphically onto open 
intervals in S' = OD. Let J; and Jz be smooth curves in OQ, meeting at p, 
as illustrated in Fig. 6.1, and denote by J, the images in S', I, = ®(.J,). It will 
suffice to show that J; and [> meet, that is, the endpoints q, and qo pictured in 
Fig. 6.1 coincide. 

Let +, be the intersection QM {z : |z — p| =r}, and let ¢(r) be the length of 
®(7,) = o,. Clearly, |qi — g2| < €(r) for all (small) r > 0, so we would like to 
show that £(1) is small for (some) small r. 

We have ¢(r) = Te |®’(z)| |dz|, and Cauchy’s inequality implies 


&(r)? 


r 


(6.8) 


< an f |e)? ds. 
Yr 


If €(r) > 6 fore <r < R, then integrating over r € [e, R] implies 


(6.9) 6 log < Qn i |®’(2)|? dx dy = 2m - Area ®(Q(e, R)) < 27”, 
Q(e,R) 
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FIGURE 6.2 Fundamental Domains 


where Q(¢, R) = OAN{z:e < |z—p| < R}. Since log(1/e) + oo as € \, 0, this 
implies that, for any 6 > 0, there exists arbitrarily small r > 0 such that £(r) < 6. 
Hence |qi — g2| < 6, So qi = qo, as needed to complete the proof. 


We next discuss a particularly important case of the Riemann mapping function 
of a domain 2. whose boundary is not smooth. Namely, 2 is a subdomain of the 
unit disk D, whose boundary consists of three circles, intersecting OD at right 
angles, at the points {1, e?"*/?, e~?7*/3} (see Fig. 6.2). Denote by 


(6.10) wW:Q— 4D 


the Riemann mapping function that preserves each of these three points. By 
Proposition 6.2, YW extends to a homeomorphism of 2 onto D. If we denote by 


yp: Da U={zEC:Imz>0} 


the linear fractional transformation of D onto U/ with the property that y(1) = 0, 
y(e?7*/3) = 1, and y(e~?7*/3) = oo, then we have 


(6.11) V=ypoVoy!:2>5Y, 


where Q = p(Q) is pictured in Fig. 6.3. WU extends to map AQ continuously onto 
the real axis, with U(0) = 0 and (1) = 1. 

Now the Schwarz reflection principle can clearly be applied to_' U, reflecting 
across the vertical lines in| AQ, to extend W to the regions Oz and Os i in Fig. 6.3. 
A variant extends UV to Ox: Note that this extension maps the closure in U/ of 
QUO; UOz UOs onto C \ {0,1}. Now we can iterate this reflection process 
indefinitely, obtaining 


(6.12) U:U —> C\ {0,1}, 
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FIGURE 6.3 Upper Half-Space Versions 


which is a holomorphic covering map. Composing on the right with ~ gives the 
holomorphic covering map 


(6.13) o=WVoy:D—>+C\ {0,1}. 


The existence of such a covering map is very significant. One simple 
application is to the following result of Picard. 


Proposition 6.3. [fu :C— C \ {0,1} is holomorphic, then it is constant. 


Proof. Using o, we lift u to a holomorphic function v : C — D, such that 
u =oa0v. But Liouville’s theorem implies that v is constant. 


With some more effort, one can prove the following result of Montel. 


Proposition 6.4. If F is a family of holomorphic maps ug, : D + S* = CU{oo} 
with range in S” \ {0,1, 00}, then F is equicontinuous. 


We leave the proof to the reader, with the comment that the trick is to make a 
careful choice of lifts vu. : D> D. 


Exercises 


1. With how little regularity of OQ can you show that Go € C*(Q)? With how little 
regularity can you show that Hy € C'(Q)? When can you show that ® : Q > Disa 
C?-diffeomorphism? 

2. Extend Proposition 6.2 to the case where OQ is assumed only to be a Jordan curve. 

3. Let 2 be the following (unbounded) region in C: 


Q={z=an2+ty:0<2<1,0<y<2}. 


Consider a Riemann mapping function ® : Q > D, with inverse 6-! : D > Q. 
Show that Re ®~+ is continuous on D, while Im ®~* is unbounded. 
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4. Let Q be a simply connected, unbounded region in C, with nonempty complement. 
Show that, given zo € C \ Q, the function (z — z)'/? can be defined on 2 as a one- 
to-one holomorphic map of 2 onto a domain O C C whose complement has nonempty 
interior. 

(Hint: If w € O, then —w ¢ O.) 
Using this, extend the Riemann mapping theorem to all such 2. 
(Hint: Use an inversion to map O to a bounded region.) 

5. State and prove an analogue of Theorem 4.3 for bounded 2 C C of the form Q = 
O: \ Oz, where O,; and Oz are simply connected, and Ox C Oj, in case ON is rough. 

6. In the proof of Proposition 6.2, it was stated that it sufficed to show that J; and J2 must 
meet. Why can’t they overlap? 


7. The Neumann boundary problem 


Let 2 be a connected, compact manifold with nonempty smooth boundary, as in 
§1. We want to study the existence and regularity of solutions to the Neumann 
problem 


(7.1) Au = f onQ, OF iene, 
Ov 


Recall that, by Green’s formula, if w and v are smooth on Q, 
_ Ou 

(7.2) (—Au,v) = (du, dv) — | B ap ds. 

V 


00. 


By continuity, this identity holds for u € H?(Q), v € H1+(Q). The boundary 


integral vanishes if Qu/Ov = 0 on OQ, so we are motivated to consider the 
operator 

(7.3) Ly : H'(Q) 3 H1(Q)* 

defined by 

(7.4) (Luu, v) = (du, dv), u,v € H1(Q). 


The operator £y is not injective, since it annihilates constants, but 
(7.5) (Ly + 1)u,u) = ||dullz2 + llullie, 


so we have 


Proposition 7.1. The map 


Ln +1: H'(Q) > B(Q)* 
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is one-to-one and onto. 
As in §1, it is clear that the inverse map 
(7.6) Try : H'(Q)* — H1(Q) 


restricts to a compact, self-adjoint operator on L7(Q), so there is an orthonormal 
basis u; of L?(Q) consisting of eigenfunctions of Ty: 


(7.7) Tnuj = MjUj, by \,0, uj € H*(Q). 


It follows that 


1 
(7.8) LNU; = Uj, Xj =—-l Sf" Co. 
Mj 


Note that since (7.4) is equal to (—Au, v) for u € H1(Q), v € C§°(Q), we have 
(7.9) uULyu) = —Au in D'(Q), 


for u € H'(Q), where . : H'(Q)* — D’(Q) is the adjoint of the inclusion 
C§°(Q) > H*(Q), but v is not injective. Nevertheless, (7.8) implies that, in the 
distributional sense, the eigenvectors u; satisfy 


(7.10) Au; = Aju; on. 


We will establish regularity theorems that imply that each u; belongs to 
C™(Q) and satisfies the Neumann boundary condition. The proof of such reg- 
ularity results is just slightly more elaborate than the proof of Theorem 1.3. We 
divide it into two parts. 


Proposition 7.2. Given f € L?(Q), u= Ty f satisfies 


Ou 
(7.11) ve HO), S|] =o, 
and 
(7.12) (-A+lju= f. 
Furthermore, we have the estimate 
(7.13) llullize S CllAullze + Cllullin. 


for all u satisfying (7.11). 
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Proof. First we establish the estimate 
(7.14) lulz < Cl|Lyullzr. + Cllullze- 
Indeed, by (7.4) and Cauchy’s inequality, we have 


llullzn = (Lnvu,u) + |lullz2 


\ 


(7.15) < ||Lyullar - lull + |lellZ2 


IA 


1 1 

sllallins + 5 lCavuls. + [lallze, 

which readily gives (7.14). To proceed, we localize to coordinate patches, as in the 
proof of Theorem 1.3. Suppose x € C°°(Q) is supported in a coordinate patch, 
and either y € C§°(Q) or 0x /Ov = 0 on OQ. We need to analyze the commutator 
(Ln, My], where M, u = xu. Note that M, acts continuously on H'(Q) and on 
H1(Q)*. For u,v € H1(Q), 

(7.16) (Ly Myu, v) = (d(xu), dv) = ((dx)u, dv) + (xdu, dv), 

while 

(7.17) (M,Lyu,v) = (Luu, xv) = (du, (dx)v) + (du, xdv), 

so 


(7.18) ((Ln, My Ju, v) = ((dx)u, dv) — (dx, du), v). 


We can integrate the first term on the right by parts, using formula (9.17) of 
Chap. 2, extended to u, v € H!(Q). The boundary integral is 


i (dx)ud dS = 0, 
0a 


by the hypothesis 0y/Ov = 0 on OQ, so we have 

(7.19) [L£n, MyJu = d*((dx)u) — (dx, du) = —(Ax)u — 2(dx, du), 
for u € H'(Q), in view of the identity 

(7.20) d* (ua) = ud*a — (du, a) 


when u is a scalar function and a a 1-form. (Compare formula (2.19) of Chap. 2, 
and also Exercise 7 in 810 of Chap. 2. In particular, 
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(7.21) [Cv, M,] : H'(Q) — LQ). 
Consequently, it suffices to prove Proposition 7.2 for u supported in a coordinate 


patch. 


We proceed by applying the estimate (7.14) to D;,;,u, as in the proof of Theorem 
1.3, where 1 < 7 < n — 1 if u is supported in a coordinate patch with boundary. 
Of course, interior regularity results proved in §1 apply here. We have 


(7.22) 
Dj nulla < CllLy Dj nullgn- + Cl|Dj,nullZ2 


<Cl|Dj nLnullzn. + Cll[Cn, Dj nlullin. + CllD; nullze, 


where D,.;, is defined on H’* in a natural fashion by duality. We need to estimate 
[£n, D;,n]u. We have 


(7.23) (Ly Dinu, v) = (dDj,nu, dv) = (DP du, dv), 


where, if translation x ++ x + he, is denoted 7; 7, we define DY on 1-forms as 


(7.24) Dp = h-r3,y — 9)- 


In order to analyze (D;,,£u,v), we simplify the calculation by requiring that 
the coordinate map of a piece of ( to a part of IR% preserve volume elements, 
which is easily arranged. Then the adjoint of D;,;, is Dj, —n, so 


(7.25) (Dj n£nu, v) = (Luu, D;,-nv) = (du, ea ) dv), 
and hence 

7.26 En De = ((D* — Dp] du, dv). 

( ) ([ N> htt, @) ([ j,—h bee ’ ) 


We have a uniform bound on the right side of (7.26): 
Lemma 7.3. [f 3 is a1-form onQ, 


|, — DP IBllz2 < Cllllz2. 


Proof. This is similar to Lemmal.4 , and we leave it as an exercise. 


We note that, as h — 0, Dy tends to the Lie derivative La, (p, for a 1-form 
iy. Thus the uniform estimate is related to the fact that the difference between La, 
and —L? a; is a zero-order operator. Compare with formula (3.43) of Chap. 2. 

Applying the lemma to (7.26), we have 


(7.27) I[Liv, DjnJullar- < Clldul[re. 
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Hence, (7.22) yields 
(7.28) Dj nulla < CDi nLyullzn. + Cllullin- 


Given Lyu = f; € L*(Q), we have (Dj, fi,v) = (fi, Dj,-nv) for v € H1(Q), 
and hence 


(7.29) Di nfillins < CllfillZe, 
SO we get 
(7.30) Djnullzn < Cllfillze + Cllullzn, 


provided Lyu = f, € L*. Letting h > 0, we get Dju € H'(Q), and an 
accompanying norm estimate. 

As in §1, the rest of the proof that u € H?(Q) comes down to showing D2.u € 
L?(Q). But by (7.9) we have Au = —f; in the distributional sense, and 


(731) 9 g™(a)DRu=—-fi— > g?®(a) Dj Deu — Sb (x) Dju, 
(j,k) A(n,n) 


so the proof that wu € H?(Q) is complete. 
It remains to show that u satisfies the Neumann boundary condition. However, 
foru = Ty f € H?(Q), v € H1(Q), the identity (7.2) holds, so 


(f,v) = (Lu, v) + (u,v) = (du, dv) + (u, v) 


(7.32) = ((-A+1)u,v) + fe au dS. 


aQ 


This holds for all v¢ H'(Q). Applying it for arbitrary v€C§°(Q) yields 
(—A + 1)u = f. Hence 


3) 
(f,0) = (fo) + f vee as, 


(eke) 


for all v € H+(Q) This forces Ou/Ov to vanish on OQ. The proof of Proposition 
7.2 is complete. 
To complete the parallel with Theorem 1.3, we have the following. 


Proposition 7.4. For k = 1,2,3,..., given fy € H*(Q), a function u € 
H**1(Q) satisfying 


(7.33) Au = fi on, a =0o0n 00 


Vv 
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belongs to H*+?(Q), and we have an estimate 

(7.34) [lull zase S ClAullze + Cllul|zes, 
for allu € H**?(Q) such that Ou/Ov = 0 on OQ. 


Proof. We proceed from Proposition 7.2 inductively, using cut-offs y and dif- 
ference operators D;;,, as in the proof of Theorem 1.3. We need to require 
Ox /Ov = 0 on OQ, so yu satisfies the Neumann boundary condition. In order 
for D; ,u to satisfy the Neumann boundary condition, this time pick coordinate 
charts so that the normal v to OQ is mapped to 0/0z,,. Then the proof works out 
just as in Theorem 1.3. 


One can also analyze nonhomogeneous boundary problems, such as 


0 
(7.35) (-A +1)u= fing, a = gon dn. 
Given g € H*+1/2(00), k = 0,1,2,..., you can pick h € H*+?(Q) such that 
Oh/Ov = g on OQ, and then write u = v + h, where v solves 


0 
(7.36) (-A+1)v=f+(A—l)hing, * = 0 onan. 

Vv 
Then v € H**?(Q) if f € H*(Q), so also u € H*+?(Q), and one has the 
estimate 


6) 
(7.37) lull Fre+2¢0) < Cl|Aullzr (a a c| ap + Cllullzr+1 (ays 


rl 
V \l HR+1/2(8Q) 


valid for all u € H**?. Let us formally record this as the following generalization 
of Proposition 7.4. 


Proposition 7.5. For k = 0,1,2,..., given f € H*(Q), g € H**/2(aQ), there 
is a unique solution u € H®*?(Q) to (7.35), and the estimate (7.37) holds. 


We note that to prove this result, one could bypass Proposition7.4 and proceed 
as follows. For k = 0, the construction (7.36) gets the result as a consequence of 
Proposition 7.2. Then you can proceed by induction on k, using cut-offs and dif- 
ference operators as in the proof of Theorem 1.3. The (slight) advantage of doing 
this is that one does not need to preserve the homogeneous boundary condition, so 
there is no need to arrange Oy /Ov = 0 on OO or use coordinate charts mapping v 
to 0/0z,,. In the case of more elaborate boundary conditions, such as considered 
in §9, the flexibility gained by this sort of strategy will be of greater importance. 

Returning to the original Neumann boundary problem (7.1), we see that the fact 
that 0 is an eigenvalue in (7.8), with eigenspace consisting of constants, implies 
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Proposition 7.6. Given f € L?(Q), the boundary problem (7.1) has a solution 
u € H?(Q) ifand only if 


(7.38) f(x) dV(x) =0. 
! 


Provided this condition holds, the solution u is unique up to an additive constant 
and belongs to H*+?(Q) if f € H*(Q),k > 0. 


We have an extension of this for the nonhomogeneous boundary problem 


(7.39) Ke=pono: 22 canbe. 
OV 
Note that if we set v = 1 in (7.2), we get 
Ou 
(7.40) : huciav = / oe dS, 
V 
Q an 


Thus a necessary condition for (7.39) to have a solution is 
(7.41) i dadVa= i. gle) as. 
Q aa 


This condition is also sufficient. 


Proposition 7.7. If k > 0, f € H*(Q), and g € H**2 (AQ), then (7.39) has a 
solution u € H**+?(Q) if and only if (7.41) holds. 


Proof. Define the linear operator 


(7.42) T : He+?(0) — H*(Q) @ H*/2(a0), 
Ou 
(7.43) Te (Au, =): 


The estimate (7.37) implies that 7 has closed range, by Proposition 6.7 in 
Appendix A. We know that the kernel of 7 consists of constants. The identity 
(7.41) implies that 


(7.44) (-1,1) € C@(Q) 6 C™ (AQ) 


is orthogonal to the range R(T ). It remains to show that this is all of the orthogo- 
nal complement of R(T), which follows if we show that 7 in (7.42) is Fredholm 
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of index zero. Now T differs from 


; 3) 
(7.45) T# : H*+2(0) — HK(Q)@H*+/2(90), THu= ((A-Du, =) 
v 
by the operator Ku = (—u,0), which is compact, by Rellich’s theorem. Propo- 
sition 7.5 implies that 7* is an isomorphism, and by Corollary 7.5 of Appendix 
A, this implies that 7 is Fredholm of index zero. This completes the proof of 
Proposition 7.7. 


Exercises 


1. Given two Riemannian manifolds M and N with boundary, p € OM, and q € ON, 
show that there exists a diffeomorphism ® from a neighborhood of p to a neighborhood 
of q, ®(p) = q, which preserves volumes. (Hint: Set up a first order PDE for one 
component of ®.) 

Use this to justify (7.25). Show that ® can also be arranged to preserve unit normals to 
the boundaries. 

2. Give a detailed proof of Lemma 7.3. 

3. If O is a compact Riemannian manifold with boundary, show that the Robin boundary 
condition 

Ou 

ao = a(x)u(x), fora € AQ, 
given a € C'™(0Q), has the regularity properties established in this section for the 
Neumann condition (which is the a = 0 case). (Hint: Make use of (7.37).) 
If a is real-valued, show that A is self-adjoint on L?(Q), with domain 


D(A) = {u € H7(Q) : Qu = a(x)u on OQ}. 


Reconsider this problem when reading §12. 

4. Let Q C R” be bounded, but do not assume OQ is smooth. Note that the map Ty in 
(7.6) is well defined. Assume there exist smoothly bounded 2; /7 © satisfying the 
following hypotheses: 


(i) There exist extension maps Ej : H*(Q;) — H+(Q) of uniformly bounded norm. 
(ii) The inclusion H'(Q) — L?(Q) is compact. 
(iii) Meas(Q \ 0;) — 0. 


Then show that if f € L?(Q), f; = flg..we have 
J 


Ty3fj —+ Twf in L7(Q), 


where Ty; is as in (7.6), with Q replaced by Q,, and we set Ty; f; = 0 on Q\ Q;. 
More information on this type of problem is given in [RT]. 


8. The Hodge decomposition and harmonic forms 


Let M be a compact Riemannian manifold, without boundary. Recall from 
Chap. 2 the Hodge Laplacian on k-forms, 
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(8.1) A:C™(M,A*) — C™(M, A*), 
defined by 
(8.2) —A = (d+6)* = dé + 6d, 


where d is the exterior derivative operator and 6 its formal adjoint, satisfying 
(8.3) (du, v) = (u, dv), 


for a smooth k-form u and (k+1)-form v; 6 = 0 on 0-forms. The local coordinate 
expression 


(8.4) Au = g(x) 0;0eu + Yju, 


where Y;, are first-order differential operators, derived in (10.23) of Chap. 2, 
indicates that the Hodge Laplacian on k-forms is amenable to an analysis similar 
to that for the Laplace operator on functions in §1. Note that, for smooth k-forms, 
by (8.3), 


(8.5) —(Au, v) = (du, dv) + (du, dv). 
Now we have A operating on Sobolev spaces; in particular, 
(8.6) A: H'(M,A*) —> H71(M,A*), 


and (8.5) holds for u,v € H1!(M,A*). We want to study invertibility of the 
operator —A + C1, where C; is a convenient positive constant, and to produce 
consequences of this. Our first result is the following analogue of the estimates 
(1.5), (1.49), and (7.15). 


Proposition 8.1. There exist positive constants Co and C) such that 
(8.7) —(Au, u) > Collullin — Ciljullze 
forak-formu € H'. 
Proof. Cover M with coordinate patches U;, and pick yp; € Cj°(U;) such that 
> 47 =1,s0 
~(Au,u) =~) [(Al¢ju),u) 


(8.8) i 
aaa Aw): yj) alr (Yu, u), 
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where Y is a first-order differential operator, Y = }>[A, ~;]. The local coordinate 
formula (8.4) and integration by parts yield 


(8.9) —(A(yju), piu) = Cllejullin — C'esullie, 
and summing gives 
(8.10)  —(Au,u) > Collullz — Cllullz2 — Call Yul ze llull ze. 


We can dominate the last term in (8.10) by e|]u||3,. + (C/e)||ul|Z.2, and absorb 
é||u|Z,. into the first term on the right side of (8.10), to prove (8.7). 


From here, a number of results follow just as in 881 and 7. We have the estimate 
||(—A + C1)ullgz-1 > Collul| a1, and hence 


(8.11) —A+C, : H'(M,A*) — A-1(M, A*) 

is injective with closed range. The annihilator of the range, in H~'* = H}, 
belongs to the kernel of —A + C4, and so is zero, so the map (8.11) is bijec- 
tive. We have a two-sided inverse 


(8.12) T: H-(M,A*) — H4(M,A*). 


As in (1.8), 7 = T*, and by Rellich’s theorem T’ is a compact self adjoint operator 
on L?(M, A*). The identity (8.5) implies 


(8.13) 0< (Tu,u) < Cy" |lullZ2, 


for nonzero u. The space L?(M, A*) has an orthonormal basis ue consisting of 


eigenfunctions of T’: 

(8.14) Tus? = pu; ul © A1(M,A*). 
By (8.13), we have 

(8.15) nee ace 


For each ’;, we can order the i so that je” \, 0, as 7 7 oo. It follows that 


(8.16) —Aul®) = Py), 
with 

i 
(8.17) MY Sa Oy 
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so 
(8.18) AM > 0, AL”) A co as j + 00. 


The local regularity results proved in Theorem 1.3 apply to A, by (8.4), and 
since MM has no boundary, we conclude that 


(8.19) ul” € C%(M, A*). 

In particular, the 0-eigenspace of A on k-forms is finite-dimensional and consists 
of smooth k-forms. These are called harmonic forms. We denote this 0-eigenspace 
by H,. By (8.5), we see that 

(8.20) ue Hy > u € C~(M,A*), du=0, and du=0onM. 


Denote by P; the orthogonal projection of L?(M, A") onto H;,. We also define a 
continuous linear map 


(8.21) G: L?(M, A) — 17(M,A*) 
by 

Gul? = 0 if SG, 
(8.22) 


ul) it, >0. 

Hence -AGus =(I- Pau. Since A : L?(M, A*) + H-?(M, A®) con- 
tinuously, it follows that 

(8.23) —AGu = (I — Py)u, for u € L?(M, A*). 

Now the local regularity implies 

(8.24) G: L?(M, A*) —> H?(M, A‘), 

and more generally 

(8.25) G : H)(M,A*) —> H!*?(M, A*), 


for 7 > 0. Using (8.2), we write (8.23) in the following form, known as the Hodge 
decomposition. 


Proposition 8.2. Given u € Hi(M, A*), we have 
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(8.26) u = ddGu+ 6dGu + Pru. 
The three terms on the right are mutually orthogonal in L?(M, A*). 


Proof. Only the orthogonality remains to be established. But if u¢ H1(M, A*—') 
and v € H1(M, A**+), then 


(8.27) (du, dv) = (d?u,v) = 0, 
and if w € Hz, so dw = dw = 0, we have 
(8.28) (du, w) = (u, dw) = 0 and (dv, w) = (v, dw) = 0, 


so the orthogonality is established. 


A smooth k-form wu is said to be exact if u = dv for some smooth (k — 1)-form 
v, and closed if du = 0. Since d? = 0, every exact form is closed: 


(8.29) €*(M) c C*(M), 

where €*(M) and C*(M) respectively denote the spaces of exact and closed 
k-forms. Similarly, a k-form wu is said to be co-exact if u = dv for some smooth 
(k + 1)-form v, and co-closed if 5u = 0, and since 6? = 0 we have 


(8.30) ceé*(M) < cc*(M), 


with obvious notation. The deRham cohomology groups are defined as quotient 
spaces: 


(8.31) H*(M) =C*(M)/E*(M). 


The following is one of the most important consequences of the Hodge decompo- 
sition (8.26). 


Proposition 8.3. [f M is a compact Riemannian manifold, there is a natural iso- 
morphism 


(8.32) H*(M) © Hx. 

Proof. Since every harmonic form is closed, there is an injection 
(8.33) j: Heo C*(M), 

which hence gives rise to a natural map 


(8.34) J: H, — H*(M), 
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by passing to the quotient (8.31). It remains to show that J is bijective. The ortho- 
gonality (8.28) shows that 


(Image 7) NE*(M) = 0, 


so J is injective. Also (8.28) shows that if u € C*(M), then ddGu = 0 in (8.26), 
sou = ddGu + Pyu, or u = Pu mod €*(M). Hence J is surjective, and the 
proof is complete. 


Clearly, the space 1” (M) is independent of the Riemannian metric chosen for M. 
Thus the dimension of the space H;, of harmonic k-forms is independent of the 
metric. Indeed, since the isomorphism (8.32) is natural, we can say the following. 
Given two Riemannian metrics g and g’ for M, with associated spaces H;, and 
Hi, of harmonic k-forms, there is a natural isomorphism H;, ~ Hi... Otherwise 
said, each u € Hx is cohomologous to a unique u’ € Hi. 

An important theorem of de Rham states that #.*(/), defined by (8.31), is 
isomorphic to a certain singular cohomology group. See §C at the end of this 
chapter for further discussion. 

We now introduce the Hodge star operator 


(8.35) *: C@(M, A*) —+ C°(M,A™-*) (m= dim M), 


in fact, a bundle map 
*t ArT — aaa! bon 


which will be seen to relate 6 to d. For (8.35) to be defined, we need to assume MZ 
is an oriented Riemannian manifold, so there is a distinguished volume form 


(8.36) we C?(M,A™). 

Then the star operator (8.35) is uniquely specified by the relation 

(8.37) uA xv = (u,v)w, 

where (u,v) is the inner product on A*T*, which was defined by (10.3) of 
Chap. 2. In particular, it follows that *1 = w. Furthermore, if {e1,...,€m} is an 
oriented, orthonormal basis of T** /, we have 


(8.38) *(e;, A+++ A €5,) = (sgn 7) €e, N-+*N€e,, 24> 


where {j1,..-, J, €1,---;€m—k} = {1,...,m}, and 7 is the permutation map- 
ping the one ordered set to the other. It follows that 


(8.39) se = (—1)*0"—*) on A*(M), 


where, for short, we are denoting C°(M, A*) by A*(M). We denote (8.39) by 
w, and also set 


446 5. Linear Elliptic Equations 


(8.40) w = (-1)* on A*(M), 
SO 
(8.41) d(uAv) =duAv+w(u) A dv. 


It follows that if u¢ A*-1(M),v € A*(M), then w(u) Ad*vu = —uAdxw(v), 
so 


d(u A xv) = du A xu —uAdx w(v) 


8.42 
we =duAxv—uA*wx*«dx*w(v), 


since *w*x = id., by (8.39). Integrating over M, since 0M = Q, we have, by 
Stokes’ formula, tg d(u A *v) = 0 and hence 


(8.43) (du,v) = fawn kU = pun «0 *d*w(v) = (u,w*d* w(v)). 
M M 

In other words, 

(8.44) 6 =Wedew = (-1)™*")—™k-1 de on A*(M). 

Thus, by the characterization (8.20) of harmonic k-forms, we have 

(8.45) * > He —> Hm-—k, 


and, by (8.39), this map is an isomorphism. In view of Proposition 8.3, we have 
the following special case of Poincaré duality. 


Corollary 8.4. [f M is a compact, oriented Riemannian manifold, there is an 
isomorphism of deRham cohomology groups 


(8.46) H*(M) = H™-*(M). 
As a further application of the Hodge decomposition, we prove the following 
result on the deRham cohomology groups of a Cartesian product MM x N of two 


compact manifolds, a special case of the Kunneth formula. 


Proposition 8.5. If M and N are compact manifolds, of dimension m and n 
respectively, then, forO <k<m+n, 


(8.47) HM xN)x @ [7(M) @ Hi(N)). 
itj=k 
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Proof. Endow MM and N with Riemannian metrics, and give M x N the product 
metric. If {u{?} is an orthonormal basis of L?(M, A‘) and {v\!} is an orthonor- 


mal basis of L?(N, AJ), each consisting of eigenfunctions of the Hodge Laplace 
operator, then the collection {ul} A ul Ee j = k} is an orthonormal basis of 


L?(M x N, A*), consisting of eigenfunctions of the Hodge Laplacian, and since 
all these Laplace operators are negative-semidefinite, we have the isomorphism 


(8.48) Hi(M x Nyx @ [i (M) @H,(N)), 
i+j=k 


where H;(/) denotes the space of harmonic i-forms on M, etc., and by (8.32) 
this proves the proposition. 


We define the ith Betti number of M to be 
(8.49) b;(M) = dim H'(M). 
Thus, (8.47) implies the identity 


(8.50) be(M x N)= So di( 
itj=k 


This identity has an application to the Euler characteristic of a product. The Euler 
characteristic of M is defined by 


(8.51) x(M) = S—(-1)' b(M), 


i=0 
where m = dim M. From (8.50) follows directly the product formula 
(8.52) y(M x N) = x(M)x(N). 


NotE. A different definition of .(/) was given in Chapter 1; see (20.11). These 
two definitions are related in §8 of Appendix C, Connections and curvature. 


Exercises 


1. Leta € A1(M"), 8 € A*(M”). Show that 


(8.53) «(taB) = ta *f. 


Find the sign. (Hint: Start with the identity a7 \a A *G = (0 A a, G)w, giveno € 
A®-1(M).) 
Alternative: Show *6 = +dx, which implies (8.53) by passing to symbols. 

2. Show that if X is a smooth vector field on M, and 6 € A*(M”), then 


Vx (#8) = *(V x). 
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3. Show that if F : 1 — M is an isometry that preserves orientation, then F*(*3) = 

4. If f : M — N isa smooth map between compact manifolds, show that the pull-back 
f* : A®(N) — A*(M) induces a homomorphism f* : H*(N) > H*(M). If fi, 
0 < t <1, is a smooth family of such maps, show that ff = ff on H*(N). 
(Hint: For the latter, recall formulas (13.60)—(13.64) of Chap. 1.) 

5. If M is compact, connected, and oriented, and dim M = n, show that 


Ho(M) » H"(M) = R. 
Relate this to Proposition 20.5 of Chap. 1. 


In Exercises 6-8, let G be a compact, connected Lie group, endowed with a bi- 
invariant Riemannian metric. For each g € G, there are left and right translations 
Lg(h) = gh, Rg(h) = hg. Let B; denote the space of bi-invariant k-forms on G, 


(8.54) B, = {68 € A*(G) : R38 = 8 = L* forallg € G}. 


6. Show that every harmonic k-form on G' belongs to B,. (Hint: If 8 € Hx, show Ro G8 
and L753 are both harmonic and cohomologous to (3. ) 

7. Show that every @ € Bx is closed (i.e., dG = 0). Also, show that * : By — Bn—x 
(n = dim G). Hence conclude 


Br = Hk. 


(Hint: To show that d3 = 0, note that if. : G > Gis v(g) = g_, then v*G € By, and 
u*B(e) = (—1)*B(e). Since also d@ € Br+1, deduce that .*d equals both (—1)* dg 
and (—1)**"d@.) 

8. With G as above, show that B; is linearly isomorphic to the center Z of the Lie algebra 
g of G. Conclude that if g has trivial center, then H*(G) = 0. 


Exercises 9-10 look at H*(S”). 

9. Let @ be any harmonic k-form on S”. Show that g*3 = (3, where g is any element of 
SO(n + 1), the group of rotations on R"*?, acting as a group of isometries of S”. 
(Hint: Compare the argument used in Exercise 6.) 

10. Consider the point p = (0,...,0,1) € S”. The group SO(n), acting on R” C R”*", 
fixes p. Show that H*(S”) is isomorphic to (a linear subspace of) 


(8.55) Vp = {8 € A*R” : g*B = G for all g € SO(n)}. 
Show that Vi = 0 if 0 < k < n. Deduce that 
(8.56) H*(S") =0 if0<k<n. 
(Hint: Given G € AIR", 1 < 7, < n, average g* over g in the group of rotations 
in the x; — ze plane.) 
Note: By Exercise 5, ifn > 1, 


(8.57) H*(S") =R ifk =Oorn. 


Recall the elementary proof of this, for k = n, in Proposition20.5 of Chap. 1. 
11. Suppose M is compact, connected, but not orientable, dim MZ = n. Show that 
H"(M) = 0. (Hint: Let M be an orientable double cover, with natural involution 
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u. A harmonic n-form on M would lift to a harmonic form on M, invariant under 1*; 
but v reverses orientation.) 


Exercises on the div-curl Lemma 


This problem set will derive a result known as the “div-curl lemma” of Murat-Tartar 
[Tar], an ingredient in the method of “compensated compactness.” The approach here 
follows [RRT]; a related approach is used in [Kic]. Further results are given in Chap. 13, 
§86, 11, and 12, and there are applications in Chaps. 14 and 16. 

1. Letajy € A‘ M be, for each j, a sequence of forms such that 


(8.58) Qj» —> aj weakly in H* as vy — oo. 


Show that 
daiy \ daz, —+ day A daz weakly in D’ as v — oo. 


(Hint: Write dai, A daz, = d(aiv A dazy); note that a1, — a1 strongly in L?.) 
2. Let oj, € A“) M be, for each 7, a sequence of forms such that 


(8.59) Ojv —~> 0; weakly in L? asv = 00. 
Suppose furthermore that 
(8.60) do jy 1s compact in Ho. 


Show that you can write oj, = daj, + Gj, where aj, satisfies (8.58) and {3;,,} is 
compact in L?. (Hint: Use the Hodge decomposition 


o = dé6Go + ddGo + Po = da+ £. 


Note that do = dG, 6G = 0. Then set aj, = dGojv.) 
3. Under the hypotheses on oj, in Exercise 2, show that 


: ! 
Ov \ O21 — 01 A o2 Weakly inD as v — on. 


Show that this can fail in examples where (8.60) is violated. 
(Hint: Write 


O1v \ O22 = d(aip N dazy) + day A Bav + Biv A daar a Biv A Gav.) 


4. Let dim M = 3, and let X, and Y, be two sequences of vector fields such that 


(i) X, + X, Y,—-Y weakly in L?, 
(ii) div X, and curl Y, are compact in H. 


Show that X, - Y, — X -Y weakly in D’. Show that the conclusion can fail in 
cases where (ii) is violated. (Hint: Produce equivalent 1-forms, and use the Hodge star 
operator to deduce this as a special case of Exercise 3.) 


Auxiliary exercises on the Hodge star operator 


In most of the exercises to follow, adopt the following notational convention. For a 
vector field u on an oriented Riemannian manifold, let w denote the associated 1-form. 
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1. Show that 
f= divus> f=*d*u. 


If M = R®, show that 
v= curlu => 0 = xdi. 


2. If wand v are vector fields on R?, show that 
w=uUxvu Sw =*(UAD). 


Show that, for @, 6 € A'(M”), *(& A 6) = (*&) |v. 
If wu x v is defined by this formula for vector fields on an oriented Riemannian 3-fold, 
show that wu x v is orthogonal to u and v. 

3. Show that the identity 


(8.61) div (u x v) =v-curlu—wu- curl v, 
for u and v vector fields on R?, is a special case of 
+d(& A 6) = (xdii,6) — (a, dd), &,6 € A'(M?). 
Deduce this from d(t A 0) = (dt) At — GA dd. 
In Exercises 4-6, we produce a generalization of the identity 


curl (u x v) =v: Vu—u- Vu + (div v)u — (div u)u 
= [v, u] + (div v)u — (div u)v, 


(8.62) 
valid for u and v vector fields on R3. 
4, For ii, € A'(M"), use Exercise 2 to show that 
dx (iA) = —(d* tt) |v + Lu (*t). 
5. Ifw € A"(M™) is the volume form, show that *(w|v) = 0. Deduce that 
«[d(*i) |v] = (div u)d. 
6. Applying Ly to (*t%) A w = (u, W)w, show that 
+Ly (xt) = [v, u] + (div v)t, 


and hence 


—~— 


«d * (tA 0) = [v, u] + (div v)t — (div u)o, 
generalizing (8.62). 


In Exercises 7-10, we produce a generalization of the identity 


(8.63) grad (u-v) =u-Vut+vu-Vutux curlu+vx curl u, 


valid for u and v vector fields on R*. Only Exercise 10 makes contact with the Hodge 
star operator. 
7. Noting that, for i, 6 € A'(M”), d(ai|v) = Lyi — (dit) |v, show that 


2d(ti|v) = Lyti + Lyd — (dit) |v — (dd) Ju. 
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8. Show that 


—— 


Lyt = [v,u] + (Lo9)(-, u), 


where g is the metric tensor, and where h(-,u) = w means h(X,u) = g(X,w) = 
(X, w). Hence 
Lytt + Lud = (Log) (su) + (Lug), 0). 


9. Show that 


(Log), u) + (Lug)(,v) = d(ajv) + Vat + Vou. 


10. Deduce that 
d(u,v) = Vut + Vott — (di) |v — (dd) |u. 


To relate this to (8.63), show using Exercises | and 2 that, for vector fields on R°, 
w=vx culu <=> w = —(du) |v. 
11. Ifu,v € A*(M™”) and w € A"—~*(M”), show that 
(w, *U) = (—1)*"—” (xw, v) 


and 
(xu, *v) = (u,v). 


12. Show that *d = (—1)**t*d* on A*(M). 
13. Verify carefully that Ax = A. In particular, on A*(M™), 


#A = Ax = (£1) [(£1)d * d+ (£1) *d* dx]. 


Find the signs. 


9. Natural boundary problems for the Hodge Laplacian 


Let M be a compact Riemannian manifold with boundary, dim M = m. We have 
the Hodge Laplace operator 


A: C™(M,A*) —+ C™(M, A*). 


As shown in §11 of Chap. 2, we have a generalization of Green’s formula, express- 
ing —(Auw, v) as (du, dv) + (du, dv) plus a boundary integral. Two forms of this, 
equivalent to formula (10.18) of Chap. 2, are 


—(Au, v) = (du, dv) + (du, dv) 


(9.1) ab : / [(oa(a,v) du, v) + (du, oa(a,v)v)] dS 
aM 


and 
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—(Au, v) = (du, dv) + (du, dv) 


(9.2) ro a / [ (Su, o5(x,v)v) a (o5(x,v)du, v)] dS. 


v 
OM 


Recall from (10.12) to (10.14) of Chap. 2 that 
1 1 

(9.3) —oq(@,v)u=VvAUu, =o5(a,v)u= —Lyu. 
i i 


We have studied the Dirichlet and Neumann boundary problems for A on 0- 
forms in previous sections. Here we will see that, for each k € {0,...,m}, there 
is a pair of boundary conditions generalizing these. To begin, suppose MV is half 
of a compact Riemannian manifold without boundary N, having an isometric 
involution tT : N — N, fixing OM and switching M and N\ M. For short, we will 
say N is the isometric double of M. Note that elements of C™° (NV) that are odd 
with respect to 7 vanish on OM, hence satisfy the Dirichlet boundary condition, 
while elements even with respect to 7 have vanishing normal derivatives on OM, 
hence satisfy the Neumann boundary condition. Now, if u € A*(N), then the 
hypothesis 7*u = —u (which implies r*du = —du and r*du = —du) implies 


(9.4) a(x, v)u = 0 and oa(xz,v)du = 0 0n OM, 
while the hypothesis 7*u = u (hence 7*du = du and r*du = du) implies 
(9.5) o5(a,v)u =0 and o5(2,v)du = 0 on 0M. 


We call the boundary conditions (9.4) and (9.5) relative boundary conditions and 
absolute boundary conditions, respectively. Thus, specialized to 0-forms, relative 
boundary conditions are Dirichlet boundary conditions, and absolute boundary 
conditions are Neumann boundary conditions. 

It is easy to see that 


(9.6) YAU yy, =0 => j*u=0, where j > OM > M. 
Thus the relative boundary conditions (9.4) can be rewritten as 

(9.7) gu=0, j* (du) =0. 

Using (9.3), we can rewrite the absolute boundary conditions (9.5) as 


(9.8) ujv =0 and (du)|v =0 on 0M. 


Also, from Exercise | of §8, it follows that 
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oa(z,v)(*xu) = + * o5(z,Vv)u, 
(9.9) 


da(z,v)6*uU=+*05(2,V) du. 


Thus the Hodge star operator interchanges absolute and relative boundary condi- 
tions. In particular, the absolute boundary conditions are also equivalent to 


(9.10) 7 (eu) =0, 7*(0*%u) =0. 


Note that if wu and v satisfy relative boundary conditions, then the boundary inte- 
gral in (9.1) vanishes. Similarly, if wu and v satisfy absolute boundary conditions, 
then the boundary integral in (9.2) vanishes. 

We define the following closed subspaces of Sobolev spaces of k-forms: 


HR(M, A*) = {u € H’(M,A*) : oa(a,v)uly,, = 0}, 
ere HA(M, A*) = {ue H*(M,A*) : 05(x,v)ul5., = OF, 

H?(M, A*) = {u € H?(M, A*) : (9.4) holds}, 

H4(M, A*) = {u € H?(M, A*) : (9.5) holds}. 


We have the following simple result, whose proof is left as an exercise. 


Lemma 9.1. Suppose M has an isometric double N, as above. Given wu € 
A®(M), set 


(9.12) Ouw=wuonM, —t*uonN\M, Euv=uonM, r*uonN \ M. 


Then, for j = 1,2, 
(9.13) O:H2(M,A*) > HI(N,A*), €: H4(M,A*) > HI(N,A*). 


Now the estimates for A on k-forms on WN established in §8 consequently 
imply the following. 


Lemma 9.2. [f M has an isometric double N, then we have an estimate 
(9.14) Weller cary) < Cllaullze cary + Cllullz2¢ay + CllullZ2cn, 


both for all u € H}(M,A*) and for all u € H}(M,A*). Furthermore, with 
b= Ror A, if 


u € H}(M,A*) and (du,dv) + (du, dv) < Cllullzzan, 


for allv € H}(M, A*), then u € H?(M, A*). 


It is convenient to rewrite the estimate (9.14) as the following pair of estimates: 
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IlullFr.c00 < C|ldullZ2 a1) + Clloullz2cn 


(9.15) 2 2 

+ Clloa(x, v)ulli2¢auy + Cllullz2 cay 
and 
0.16) llullzrs cary S Cllaull72 cary + CllOullZ2 (0) 


+ Cllos(x,”)UllF/2~amy + Cllullzzay: 


both valid for all wu €¢ H+(M, A*). 

So far, the estimates (9.15) and (9.16) have been shown to hold when M has 
an isometric double. Now any compact manifold M with smooth boundary has 
a double N, a smooth manifold without boundary, together with a smooth invo- 
lution 7 fixing OM. Also, N possesses Riemannian metrics invariant under rT. 
However, if MM is endowed with some Riemannian metric, it may not extend to 
a smooth invariant Riemannian metric on NV. For example, a necessary condition 
for such a metric to exist on N would be that 0M is totally geodesic in M. Our 
next task will be to show that the estimates (9.15) and (9.16) hold in general. 

To begin, if x € C°(M) is a cut-off, since the commutators [d, x] and [d, x] 
are bounded on L?(M), we see that it suffices to prove the following. 


Lemma 9.3. For any p € M, there is a neighborhood O of p in M such that the 
estimates (9.15) and (9.16) hold for u supported in O. 


Of course, for interior points p, such estimates follow from the analysis of 88, 
so we need only consider p € OM. 

For p € OM, choose a coordinate mapping of a neighborhood O, of p in M 
to a neighborhood U, of 0 in R™, such that the induced Riemannian metric Jjk AS 
equal to dj, at 0. In addition to the induced metric g on U; (which gives rise to 

= + dx), we have the flat metric g° on U,, Ik = 6;,, and associated operator 


6°. The differential operators 6 and 6° are first-order differential operators whose 
principal symbols agree at the origin 0. Of course, the exterior derivative operator 
dis independent of the metric; d = d°. We also note that the unit normal v to 0M 
with respect to the metric g is equal to the normal v° = dz, with respect to the 
flat metric, at the origin, so the 0-order operators g(x, v) and ogo (a, v°) agree at 
0, and so do the 0-order operators o5(2, 1”) and os0(x,v°). 

Now the reflection argument described above shows that if we have u € 
H'(U,, A*), vanishing on the upper boundary, then 


(9.17) 
llellza(u,y S Cl ullzace,) + ClO ullz2(y,) + C||Beullinea + Cllullzecas: 


where [is R"~' = OR", compactified into a torus by putting U; NR"! ina 
big box and identifying opposite sides. Also, B° in (9.17) is either ogo(x,v°) or 
o50(x,v°). On the other hand, if in addition the support of wu is in a sufficiently 
small neighborhood of 0, we have 
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(9.18) \|ou — HP ullzocu,) < ellullznw,y + Cllullz2@,) 
and 
(9.19) ||Bu — Beullznacry S llullznacry < Coellullinw,y; 


where B is either og(x,v) or o5(x,v), depending on the choice of B°. Con- 
sequently, for wu with sufficiently small support, the estimates (9.15) and (9.16) 
follow from (9.17). This proves Lemma 9.3 and consequently, in view of the 
observation on cut-offs, we have the following. 


Proposition 9.4. If M is a compact Riemannian manifold with smooth boundary, 
then the estimates (9.15) and (9.16) hold for all u € H*(M,A*). Hence the 
estimate (9.14) holds both for all u € Hk(M, A*) and for all u € H}(M, A*). 


In analogy with our treatment of the Neumann boundary condition in §7, we 
define an operator 


(9.20) Lr: HE(M,A*) — H}(M, A*)* 
by 
(9.21) (Lpu,v) = (du, dv) + (du, dv), u,v € Hp(M,A*), 


and we also define 


(9.22) La: H4(M,A*) — H4(M,A*)* 
by 
(9.23) (Lau, v) = (du, dv) + (du, dv), u,v € H4(M, A*). 


The estimates (9.15) and (9.16) show that, with b = R or A, and some Cp > 0, 
(9.24) ((Lo+Co)u,u) >Cllullinasy, uv € He(M,A*), 


which as before leads to the following. 


Proposition 9.5. For b = Ror A, the maps 
(9.25) fe + Cy AEM A) — A A*)* 


are one-to-one and onto. 


The maps 


(9.26) Tp : Hi(M, A*)* —> Hi (M, A*) 
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giving two-sided inverses of (9.25) are compact, self-adjoint operators on L?(M, 
A*), so we have orthonormal bases {ul} and {ub} of L?(M, A*) satisfying 


(9.27) Tru? = pu®, ul © HEM, A*), 
and 

k k k k A 
(9.28) Tav) = vy, vy) © HA(M, A*). 


Since clearly ((L, + 1)u,u) > ||ul|72, we can take Co = 1 in (9.25). Then the 
eigenvalues of Tp and 7’, all have magnitude < 1, and we can order them so that, 


for each k, a” and il \, 0. as 7 — oo. It follows that, for each k, 
1 
(9.29) Lru® = pus, pj = ay -1 7 0, 
J 
and 
1 
(9.30) Lavy) =alu, aj) = 5-1 Po. 
Vs 
j 


Here, a> > 0 and a") > 0, and only finitely many of these quantities are equal 
to zero. 

We can produce higher-order regularity results by the same techniques as used 
for the Dirichlet and Neumann problems. In analogy with Proposition 7.2, we 


have 

Proposition 9.6. With b = R or A, given f € L?(M,A*), u=Tyf satisfies 
(9.31) u € H?(M,A*), 

and there is the estimate 


0 
Ss lle cay S Cll Aull: cay + CUBS ull3z9/2¢001) 
: 1 
+ CBP ulljnaony + Cllullinan: 


for allu € H?(M, A*), where 


Bou =o4(2,v)u, Bou = 06(x,v)u, 


(1) 


(9.33) x 
BO u =oa(z,v)du, By’u=o5(2,v)du. 


This can be proved in the same way as Proposition 7.2. We give details on why 
the boundary conditions hold in (9.31), which are slightly more involved than 
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before. We claim that, given u € H}(M,A*), with Au € L?(M, A*), then the 
boundary term in (9.1)—(9.2) vanishes for all v € A, 3 (M : AP) if and only if all the 
appropriate boundary data for u vanish; for example, og(x,v)du = 0 on OM, in 
case b = R. We need to establish the “only if” part. Take the case b = R. Pick 
ao € C~(M,Hom(A*-!, A*)) such that o(2) = oa(x,v), for x € OM. Then, 
for any w € A*~1(M), we have v = ow € H}(M,A*), and hence, for any 
u € H}(M, A*), the boundary term in (9.1) is equal to 


B(u,v) = : J hoate.v) du, oa(a,v)w) dS 


=— J loate.v)*oalx,v) (du), w) dS. 
aM 


This vanishes for all w € A*~!(M) if and only if oa(a,v)*oa(z,v) (du) = 0 on 
OM, which in turn occurs if and only if og(x,v) (6u) = 0 on OM. Thus, obtain- 
ing u € H?(M, A*) by the methods used in Proposition 7.2, we have (9.31), in 
case b = R. The case b = A is similar. 

Next, the same arguments proving Proposition 7.5 and Theorem 1.3 establish 
the following. 


Proposition 9.7. Given f, € H(M,A*), j = 1,2,3,..., a k-formu € 
HI*1(M, A*) satisfying 


(9.34) Au = f; on M 


and either of the boundary conditions (9.4) or (9.5), belongs to Hi+?(M, A®). 
Furthermore, we have estimates 


0 
llull3ss2cary < ClAull3scy + CBP ull2,:49/2¢0m) 
(9.35) 
ae CB alleen oP Cllullirs+3¢00) 


for allu € H!+?(N, A*), where BY are given by (9.33). 


One corollary of this is that the eigenfunctions ul®) and ul) are inC™©(M, A*) 
and satisfy the boundary conditions (9.4) and (9.5), respectively. The 0-eigen- 
spaces of Lp and L, are finite-dimensional spaces in C°(M, A”); denote them 
by HE and H4, respectively. We see that, for b = R or A, 


ue He — ue C%(M,A*), Bou =0onoM, 
(9.36) 
and du = du=O0onM. 


Again, Bo) are given by (9.33). Equivalently, 
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(9.37) BOusvau, BQua=ulv. 
Also recall that we can replace v \u by j*u. To state the result slightly differently, 
(9.38) uc H? — we Ai(M,A*) and du=du=0. 
We call HE and He the spaces of harmonic k-forms, satisfying relative and abso- 
lute boundary conditions, respectively. 

Denote by Pe and Pe the orthogonal projections of L?(M, A") onto HE and 
HA. Parallel to (8.22)and (8.23) we have continuous linear maps 
(9.39) G? : L?(M,A*) —> H2(M,A*), b= RorA, 
such that G? annihilates 1? and inverts —A on the orthogonal complement of H?: 
(9.40) —AG*u = (I — P?)u, foru € L?(M,A*), 
and furthermore, for 7 > 0, 
(9.41) G? : HI(M, A®) —> H5*?(M, A*). 


The identity (9.40) then produces the following two Hodge decompositions for a 
compact Riemannian manifold with boundary. 


Proposition 9.8. Given u € Hi(M, A*), j > 0, we have 

(9.42) u= dbG®u+ 6dG®u+ PPu = PPut PPut PPu 
and 

(9.43) u = ddG4u4+ 6dG4u+ PAu = Pfu+ PAut PAu. 


In both cases, the three terms on the right side are mutually orthogonal in 


L?(M, A*). 

Proof. It remains only to check orthogonality, which requires a slightly longer 
argument than that used in Proposition 8.2. By continuity, it suffices to check the 
orthogonality for u € C°°(M, A*). We will use the identity 

(9.44) (du, v) = (u, dv) + ¥(u, v), 


for u € AJ-1(M) and v € AJ(M), with 


(9.45) y(u,v) = : J hoale.v)u.e) dS = : J(u os(e,v)e) dS. 
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Note that y(u, v) = 0 if either wu € Hp(M, A’—') or v € H4(M, A). In partic- 
ular, we see that 


u € H}(M, A®-1) => du ker 60 H*(M, A‘), 
(9.46) 


VE Hi(M, Ar) => §v L kerdn H'(M, AR-1), 
From the definitions, we have 


6: H2(M,A1) — H,(M,7—-4), 


(9.47) . . 
d: H3(M, A’) —> H4(M, AJ*), 
SO 
d5H?(M,A*) | kerdN H1(M, A*), 
(9.48) 
6dH4(M,A*) | kerdm H'(M, A*). 


Now (9.48) implies for the ranges: 

(9.49) (PZ!) L RPS) + RP), RCPS) L RPG) + PH): 
Furthermore, if uc HE and v=dG"w, then y(u,v)=0, so (u,dv) = 
(du,v) = 0. Similarly, if v € H@ and u = 6G4w, then 7(u,v) = 0, so 
(du, v) = (u, dv) = 0. Thus 

(9.50) RPS) LR(PR), RPP) L RFF). 

The proposition is proved. 


We can produce an analogue of Proposition 8.3, relating the spaces H? to coho- 
mology groups. We first look at the case b = R. Set 


(9.51) C&?(M, A*) = {u€ C?(M, A*): j*u = 0}. 
Since do j* = j* od, it is clear that 

(9.52) aC? ( A*) —3 C(O A), 
Our spaces of “closed” and “exact” forms are 


M)= (MM, AP): — 
aes M) = {ue C>(N1,A¥) : du = 0}, 
M)=d 


) =a C%(M, A*-}), 


We set 
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(9.54) H*(M,0M) = CR(M)/ER(M). 


Proposition 9.9. If M is a compact Riemannian manifold with boundary, there 
is a natural isomorphism 


(9.55) H*(M,OM) = He. 
Proof. By (9.36) we have an injection 


which yields a map 

J: He — H*(M,0M), 
by composing with (9.54). The orthogonality of the terms in (9.42) implies 
(Image j) 1 E£(M) = 0, so J is injective. Furthermore, if u € Ch(M), then 
u is orthogonal to dv for any v € C?(M, A*+"), so the term 6(dG”u) in (9.42) 
vanishes, and hence J is surjective. This proves the proposition. 


As in 88, it is clear that H*(M, 0M) is independent of a metric on M. Thus 
the dimension of #/{’ is independent of such a metric. 

Associated to absolute boundary conditions is the family of spaces 
(9.56) C?(M, A*) = {u € C™(M,A*) : 4,4 = 4,(du) = 0}, 
replacing (9.51); we have 
(9.57) d:C™(M,A*)—> C?(M, A**), 


and, with C¥ (/) the kernel of d in (9.57) and €4*"(M/) its image, we can form 
quotients. The following result is parallel to Proposition 9.9. 


Proposition 9.10. There is a natural isomorphism 
(9.58) He = ch (M)/E%(M). 
Proof. This is exactly parallel to the proof of Proposition 9.9. 


We have refrained from denoting the right side of (9.58) by H*(M), since the 
deRham cohomology of /M has the standard definition 


(9.59) H*(M) =C*(M)/E*(M), 


where C*( I) is the kernel and €*+1!(/) the image of d in 
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(9.60) d: C™(M, A*®) — O° (M, A**"), 


Note that no boundary conditions are imposed here. We now establish that (9.58) 
is isomorphic to H*(M). 


Proposition 9.11. The quotient spaces C\(M)/E%(M) and H*(M) are natu- 
rally isomorphic. Hence 


(9.61) Hi = H*(M). 
Proof. It is clear that there is a natural map 
K:C4(M)/E4(M) — H*(M), 


since Ci (M) c C*(M) and €&(M) c E*(M). To show that x is surjective, let 
a € C®~(M, A*) be closed; we want @ € Ck(M) such that a — @ = d@ for some 
BeEC*(M,A*?), 

To arrange this, we use a 1-parameter family of maps 


(9.62) yi: M— 3M, 0<t<1l, 


such that yo is the identity map, and as t — 1, y; retracts a collar neighborhood 
O of OM onto OM, along geodesics normal to OM. Set @ = yija. It is easy to 
see that @ € C*(M). Furthermore, a — @ = d@ with 


1 
(9.63) B= -| vt (a| X(t) dt € C&(M, AP}, 
0 


where X(t) = (d/dt)y,:. Compare the proof of the Poincaré lemma, Theorem 
13.2 of Chap. 1, and formulas (13.61)—(13.64) of that chapter. It follows that « is 
surjective. 

Consequently, we have a natural surjective homomorphism 


(9.64) &: HEA — H*(M). 


It remains to prove that & is injective. But if a € Hf# anda = dB, B € 
C~(M, A*-'), then the identity (9.44) with du = dG, v = a implies (a, a) = 0, 
hence a = 0. This completes the proof. 


One can give a proof of (9.61) without using such a homotopy argument, in 
fact without using CK (M)/E% (MM) at all. See Exercise 5 in the set of exercises on 
cohomology after this section. Such an argument will be useful in Chap. 12. On 
the other hand, homotopy arguments similar to that used above are also useful, 
and will arise in a number of problems in this set of exercises. 
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We can now establish the following Poincaré duality theorem, whose proof 
is immediate, since by (9.9) the Hodge star operator interchanges absolute and 
relative boundary conditions. 


Proposition 9.12. Jf M is an oriented, compact Riemannian manifold with 
boundary, then 


(9.65) «: HE = HA 


m—k 


is an isomorphism, where m = dim M. Consequently, 
(9.66) H*(M,O0M) = H™-*(M). 


We end this section with a brief description of a sequence of maps on coho- 
mology, associated to a compact manifold M with boundary. The sequence takes 
the form 


(9.67) -»» > H*®-1(aM) & H*(M,OM) & H*(M) 5 HR(OM) 4 --- 
These maps are defined as follows. The inclusion 
Or°(M, A") + C%°(M, A*), 


yielding CE(M) c C*(M) and €£(M) c €*(M), gives rise to 7 in a natural 
fashion. The map z comes from the pull-back 


j* :C®(M, A®) — C”(M, A*), 


which induces a map on cohomology since j*d = dj*. Note that j7* annihilates 
C%(M, A*),sorom=0. 

The “coboundary map” 6 is defined on the class [a] € H*~1(0M,R) of a 
closed form a € A*~!(9M) by choosing a form @ € C%(M, A*~+) such that 
j* 3 = wand taking the class [dG] of d3 € CR(M). Note that d3 might not belong 
to €&(M) if j*G is not exact. If another B is picked such that j* 38 = a + dy, then 
d(3 — 3) does belong to E®(M), so 6 is well defined: 


dla] = [d6], with j*6B =a. 


Note that if [a] = 1[6], via a = j* 8 with 6 € C*(M), then dB = 0, s0 501 =0. 
Also, since d3 € E*(M), 706 =0. 

In fact, the sequence (9.67) is exact, that is, the image of each map is equal to 
the kernel of the map that follows. This “long exact sequence” in cohomology is 
a useful computational tool. Exactness will be sketched in some of the following 
exercises on cohomology. 
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Another important exact sequence, the Mayer-—Vietoris sequence, is discussed 


in Appendix B at the end of this chapter. 


Exercises 


oa 


. Let u bea 1-form on M with associated vector field U. Show that the relative boundary 


conditions (9.4) are equivalent to 
U LOM anddivU =0on0M. 
If dim M = 3, show that the absolute boundary conditions (9.5) are equivalent to 
U ||OM andcurlU 1 OM. 


Treat the case dim M = 2. 


. Let b = Ror A. Consider the unbounded operator D, on H = @ L?(M, A*): 
k 


Dy=d+6, D(Dv) = HG (M, A"). 
k 


Here D(Dy) denotes the domain of Dy. Show that Dp is self adjoint, that D(D?) = 
@ H2(M, A*), and that D? = —A on this domain. Show that 
k 


u=(d+6)G?(d+5)u+ Pru, foru € Hy(M, A*). 


Reconsider this problem after reading §$11 and 12. For a discussion of unbounded 
operators defined on dense domains, see §8 in Appendix A. 
Show that d and 6 map D(Dj*") to D(D?), for j > 0. 
Form the orthogonal projections P? = déG®, P? = 5dG?. With b = R or A, show 
that the four operators 

G”, Pp, Pi, and Ps 
all commute. Deduce that one can arrange the eigenfunctions ue , forming an orthonor- 
mal basis of L?(M, A*), such that each one appears in exactly one term in the Hodge 


decomposition (9.42), and that the same can be done with the eigenfunctions ol®), rel- 


ative to the decomposition (9.43). 
If M is oriented, and « the Hodge star operator, show that 


Ta *=*TrR, 
where J’, and Tp are as in (9.26). Show that 
PAx=x« PP and G4x«=*«G®*. 
Also, with P® and P? the projections defined above, show that 


PA x= PP and PA x = PF. 
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Exercises on cohomology 


1. Let M be a compact, connected manifold with nonempty boundary, and double N. 
Endow N with a Riemannian metric invariant under the involution 7. Show that 


(9.68) H*(M,OM) = {u € He(N) : 7*u = —u}. 
Deduce that if // is also orientable, 
(9.69) H"(M,OM)=R, n= dim M. 
2. If M is connected, show directly that 
H°(M) =R. 


By Poincaré duality, this again implies (9.69), when M is orientable. 
3. Show that if M is connected and OM # 9, 


H°(M, 2M) = 0. 


Deduce that if 1/4 is also orientable, n = dim M, then 


H"(M) =0 


Give a proof of this that also works in the nonorientable case. 
4. Show directly, using the proof of the Poincaré lemma, Theorem 13.2 of Chap. 1, that 


(9.70) H*(B") =0, 1<k<n, 
where B” is the closed unit ball in R”, with boundary S"~+. Deduce that 


H*(B",S""1)=0, O0<k<n, 
(9.71) 
R, k=n. 


5. Use (9.48) to show directly from Proposition 9.8 (not using Proposition 9.11) that, if 
a € C®(M, A*) is closed, then a = dG + Pa for some 3 € C®(M, A*~?), in 
fact, for 3 = 6G“a. Hence conclude that 


He © H'(M) 
without using the homotopy argument of Proposition 9.11. 


Let M be a smooth manifold without boundary. The cohomology with compact 
supports H& (M) is defined via 


(9.72) a0 (iA) Ss OP Ok), 


as 
He(M) =Ce(M)/Ec(M), 
where the kernel of d in (9.72) is CX (M) and its image is E27"(M). 


In Exercises 6 and 7, we assume M is the interior of a compact manifold with 
boundary M. 
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6. Via C&°(M, A*) — Ce°(M, A*), we have a well-defined homomorphism 
p:H*(M) — H*(M, 0M). 


Show that p is injective. (Hint: Let y; : M — M be as in (9.62); also, given K CC 
M, arrange that each ¢; is the identity on kK. If a € C*(M) has support in K and 
a = d§, with 6 € CP (M, A*~"), show that 8 = yi has compact support and 
dB =a.) 


7. Show that p is surjective, and hence 
(9.73) Hi (M) = H*(M, aM). 


(Hint: If a € C&(M), set @ = vita and parallel the argument using (9.63), in the 
proof of Proposition 9.11.) 
8. If M is connected and oriented, and dim M = n, show that 


even if / cannot be compactified to a manifold with smooth boundary. 
(Hint: If « € Co°(M, A”) and f,, a = 0, fit the support of a in the interior Y of 
a compact, smooth manifold with boundary Y C M. Then apply arguments outlined 
above.) 

9. Let X be a compact, connected manifold; given p € X, let M = X \ {p}. Then 
C§°(M, A*) — C%(X, A*) induces a homomorphism 


4: HE(M) — H*(X). 


Show that ¥y is an isomorphism, for 0 < k < dim X. (Hint: Construct a family of maps 
a, : X — X, with properties like y; used in Exercises 6 and 7, this time collapsing 
a neighborhood O of p onto p as t — 1. Establish the injectivity and surjectivity of 
by arguments similar to those used in Exercises 6 and 7, noting that the analogue of 
the argument in Exercise 7 fails in this case when k = 0.) 

10. Using Exercise 9, deduce that 


(9.74) HES") @He(R"), O<kSn. 
In light of Exercises 4 and 7, show that this leads to 


H*(S")=0 if0<k<n, 


(9.75) ‘ 
R ifk=Oorn, 


provided n > 1, giving therefore a demonstration of (8.56)—(8.57) different from that 
suggested in Exercise 9 of §8. 


Exercises 11—13 establish the exactness of the sequence (9.67). 
11. Show that ker 2 C im 7. (Hint: Given u € C*(M), j*u = dv, pick w € A*~'(M) 
such that j*w = v, to get u — dw € Ce°(M, A*), closed.) 
12. Show that ker 6 C im v. (Hint: Given a € C*(OM), if a = j* 8 with [dG] = O in 
H**1(M, 0M), that is, d3 = dB, B € C°(M, A*), show that [a] = [6 — 6.) 
13. Show that ker 7 C im 6. (Hint: Given u € CR(M), if u = dv, v € A*~1(M), show 
that [u] = d[2].) 
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14. Applying (9.67) to M = B”*1, the closed unit ball inR"*", yields 
(9.76) H* (Betty 4 H*(9") & Het (Bett, 9") 2 Het} (Bet), 


Deduce that 
H*(S") = H**1(Brt1, 9"), fork > 1, 


since by (9.70) the endpoints of (9.76) vanish for k > 1. Then, by (9.71), there follows 
a third demonstration of the computation (9.75) of H*(S”). 

15. Using Exercise 3, show that if / is connected and 0M # @, the long exact sequence 
(9.67) begins with 


0 H°(M) & H°(aM) * H1(M,aM) > --- 
and ends with 
-3 HM) 4 H"-1(0M) & H"(M, 0M) = 0. 
16. Define the relative Euler characteristic 


x(M,0M) = 5° (-1)* dim H*(M, 0M). 


k>0 
Define y(M/) and x(0M) as in (8.51). Show that 
x(M) = x(M,0M) + x(0M). 


(Hint: Show that, for any exact sequence of the form 


0—-VY--:- 3 Vw - O,—7 


with V;, finite-dimensional vector spaces over R, )(—1)* dim Vz = 0.) 
17. Using Poincaré duality show that if 1 is orientable, n = dim M, 


x(M) = (—1)"x(M, aM). 


Deduce that if n is odd and M orientable, x(0M) = 2x(M). 
18. If N is the double of M, show that 


dim H*(N) = dim H*(M) + dim H*(M, 0M). 
Deduce that if M is orientable and dim M is even, then y(N’) = 2y(M). 


In Exercises 19-21, let Q; be compact, oriented manifolds of dimension n, with 
boundary. Assume that 00; 4 @ and that Q2 is connected. Let F : 1 — Qe bea 
smooth map with the property that f = F | an, : OQ, — OQz2. Recall that we have 
defined Deg f in §20 of Chap. 1, when OM: is connected. 

19. Leto € A” (Q2) satisfy Jo, 7 = 1. Show that f, F*a is independent of the choice 


of such g, using H”(;,00;) = R. Compare Lemma 20.6 of Chap. |. Define 


Deg F=f Fo. 
1 


20. 


21. 


22. 


23. 
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Produce a formula for Deg F, similar to (20.16) of Chap. 1, making use of F~+(yo), 
with yo € Qe. 

Prove that Deg F' = Deg f, assuming OQ2 is connected. 

(Hint: Pick w € A"~'(AQz2) such that Son. w = 1, pick & € A"~*(Q2) such that 
j°@ = w, and let co = d&. Formulate an extension of this result to cases where 0Q2 
has several connected components.) 

Using the results of Exercises 19-21, establish the “argument principle,” used in the 
proof of the Riemann mapping theorem in 84. (Hint: A holomorphic map is always 
orientation preserving.) 


In Exercise 23, we assume that M is a compact manifold with boundary, with inte- 
rior M. Define H*(M) via the deRham complex, d : A*(M) — A**1(M). It is 
desired to establish the isomorphism of this with H* (1/). 

Let C be a small collar neighborhood of 0M, so Mi = M \ C is diffeomorphic to 
M. With j : M1 — M, show that the pull-back j* : A¥(M) — A*(M,) induces an 
isomorphism of cohomology: 


H*(M) = H*(M1). 
(Hint: For part of the argument, it is useful to consider a smooth family 


yi: M — Mi, O0<t<1l 


of diffeomorphisms of M onto manifolds M;, with Mo = M and yo = id. If BE 
A*(M) and dG = 0, and if 6; = yt j*G, then 


a=h-a(f ialxt at), 


where X(t)(x) = (d/dt)y;(a). Contrast this with the proof of Proposition 9.11.) 


Exercises on spaces of gradient and divergence-free vector fields 


In this problem set, we will work with the spaces 


(9.77) Ve ={v€C™(M, A’) : dv =00n M,t,v = 0 0n dM} C Ha(M, A’) 


and 


(9.78) G = {dp:p€ H'(M)}. 


We assume that 7 is a compact Riemannian manifold with boundary. These are spaces 
of 1-forms rather than vector fields, but recall that under the correspondence induced 
by the Riemannian metric, dv < div V and dp < grad p. 

Show that V, L G. 


. Suppose v € L?(M, A‘) is orthogonal to G. Show that dv = 0 on M, that u,v exists 


on OM, and that 1,v = 0 on OM, as the identity 


(v, dp) za = (6,p) za + , Laseiae 
OM 
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is valid under these hypotheses. Conclude that G te V3, where 
Vz = {v € L?(M, A’) : 6v =00n M, pv = 00n dM}. 


Show that actually Gt = V,.. (Hint: The space {dp : p € C®(M)} is dense in G.) 
3. Show that v, du! sy = OS i ddu| om = 0: (Hint: Use (9.6) and (9.9).) 
4. Show that v € L?(M, A’) is orthogonal to G if and only if its Hodge decomposition 


(9.43) takes the form . 4 
v = 6dG°u+ Prov. 


(Hint: Show that 6dH4(M, A‘) | G. To see this, use either (9.48) or Exercises 2-3.) 
5. Deduce that 


(9.79) Gt =V, = ddH3(M, A') @Ht =Ve, 
where Vz denotes the closure of Vz in L? (M, A’), and that the decomposition 
(9.80) L?(M,A')=GOV~ 


is implemented by the Hodge decomposition (9.43), for k = 1. 
(Hint: H3,(M, A') has a dense subspace of smooth forms on M.) 

6. Deduce that if u € H?(M, A’), then its L?-orthogonal projections onto G and onto V 
belong to H?(M, A'), 7 > 0. 

7. From Exercise 4, it follows that dH'(M) = G = d5H4(M, A’). Establish that in fact 


H'(M) = 6H4(M, A’) +R, 
via the Hodge decomposition for 0-forms, 
L?(M) = 6dH3(M) @ Ho; He =R 


(provided M is connected), where H3(M) = H3(M, A°) is given by the Neumann 
boundary condition. We have u = 6dG“u + Pj‘u, where 


G4 : H®(M) > {v © Hit?(M): fe dV = o} 
M 


comes from solving the Neumann problem. 


10. Isothermal coordinates and conformal structures on 2D 
surfaces 


Let / be an oriented manifold of dimension 2, endowed with a Riemannian met- 


ric g. We aim to apply some results on the Dirichlet problem to prove the following 
result. 


Proposition 10.1. There exists a covering U; of M and coordinate maps 
(10.1) y; :U; +O; CR? 


that are conformal (and orientation preserving). 
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By definition, a map y : U — O between two manifolds, with Riemannian 
metrics g and go, is conformal provided 


(10.2) 9" 90 = Ag, 

for some positive 4 € C°(U). In (10.1), O; is of course given the flat met- 
ric dx? + dy”. Coordinates (10.1) that are conformal are also called “isothermal 
coordinates.” It is clear that the composition of conformal maps is conformal, so 
if Proposition 10.1 holds, then the transition maps 


(10.3) Wik = P53 °V, : Ojn —> On; 


are conformal, where Oj; = px(U; Ux). This is particularly significant, in view 
of the following fact: 


Proposition 10.2. An orientation-preserving conformal map 
(10.4) ~w:0 30 
between two open domains in R? = C is a holomorphic map. 

One way to see this is with the aid of the Hodge star operator *, introduced in 
§8, which maps A!(M) to A'(M) if dim M = 2. Note that, for M = R?, with 
its standard orientation and flat metric, 


(10.5) xdx=dy, *dy = —dz. 


Since the action of a map (10.4) on 1-forms is given by 


eid Of 4, — 
wr dz = aq eet By Ya 
(10.6) 5 is 
hyp = 2 dt ay 


if v(a,y) = (f,g), then the Cauchy—-Riemann equations 


Of Og Og __ Of 
Ox Oy’ Ox Oy 


(10.7) (ie, * df = dg) 


are readily seen to be equivalent to the commutativity relation 
(10.8) «0 (~*) = (¥*) o x on 1-forms. 


Thus Proposition 10.2 is a consequence of the following: 
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Proposition 10.3. If MM is oriented and of dimension 2, then the Hodge star oper- 
ator * : TM — TM is conformally invariant. 


In fact, in this case, * can be simply characterized as counterclockwise rotation 
by 90°, as can be seen by picking a coordinate system centered at p € M such that 
Gjk = Ojp at p and using (10.5). This characterization of * is clearly conformally 
invariant. 

Thus Proposition 10.1 implies that an oriented, two-dimensional Riemannian 
manifold has an associated complex structure. A manifold of (real) dimension two 
with a complex structure is called a Riemann surface. 

To begin the proof of Proposition 10.1, we note that it suffices to show that, for 
any p € M, there exists a neighborhood U of p and a coordinate map 


(10.9) ~w=(f,9):U =OCcR’, 


which is conformal. If df(p) and dg(p) are linearly independent, the map (f, g) 
will be a coordinate map on some neighborhood of p, and (f, g) will be conformal 
provided 


(10.10) «df = dg. 

Note that if df(p) # 0, then df(p) and dg(p) are linearly independent. Suppose 
f € C™(U) is given. Then, by the Poincaré lemma, if U is diffeomorphic to a 
disk, there will exist ag © C™(U) satisfying (10.10) precisely when 

(10.11) dx df =0. 

Now, as we saw in §8, the Laplace operator on C™(M) is given by 


(10.12) Af =—6df =— «dx df, 


when dim M = 2, so (10.11) is simply the statement that f is a harmonic function 
on U. Thus Proposition 10.1 will be proved once we establish the following. 


Proposition 10.4. There is a neighborhood U of p and a function f € C®(U) 
such that Af = 0 on U and df (p) 4 0. 


Proof. In a coordinate system x = (21, 72), we have 


Af(x) = g(a)~/? 8; (g7*(x)g(a)*/? Oxf) 
= g)* (x) O;Onf + d*(a) Oxf. 


Pick some coordinate system centered at p, identifying the unit disk D C R? with 
some neighborhood U; of p. Now dilate the variables by a factor ¢, to map the 
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small neighborhood U, of p (the image of the disk D_- of radius ¢ in the original 
coordinate system) onto the unit disk D. In this dilated coordinate system, we have 


(10.13) Af (x) = 9)" (ex) 0;0nf + eb* (ex) Oxf. 


Now we define f = f- to be the harmonic function on U; equal to x; /e on OU; 
(in the original coordinate system), hence to x; on OD in the dilated coordinate 
system. We need only show that, for e > 0 sufficiently small, we can guarantee 


that df-(p) 4 0. 
To see this note that, in the dilated coordinate system, we can write 


(10.14) fe =%1 —evz on D, 
where v- is defined by 


(10.15) A.v, = b'(ex) onD, ve 


clap = 9 


and A. is given by (10.13) Now the regularity estimates of Theorem 1.3 hold 
uniformly in ¢ € (0, 1] in this case, so we have uniform estimates in H*(D) on 
Ue as € —> 0 for each k, and consequently uniform estimates on v- in C!(D) as 
€ — 0. This shows that df-(p) 4 0 for < small and completes the proof. 


Having given the 2D Riemannian manifold M the structure of a Riemann sur- 
face, we make further observations about holomorphic functions on M. To start, 
we write the Cauchy-Riemann equations for a holomorphic function f = u + iv 
on M (u, v real) as 


(10.16) xdu=dv, *dv = —du, 
or equivalently 
(10.17) xdf = —idf. 


Here df = du + idv is a complex-valued 1-form on M. If we denote by O(/) 
the space of holomorphic functions f : M@ — C, we have 


(10.18) f € O(M) = df €0'(M), 
where 


O'(M) = space of complex-valued 1-forms 3 on M such that 


(10.19) : 
d@=0, *@=-—if. 


Note that 


(10.20) Be O'(M) dx 3=dG=0 KG =U, 
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where A denotes the hodge Laplacian on (complex-valued) 1-forms on M. The 
following is a useful complement to (10.18). 


Proposition 10.5. Let M be an oriented 2D Riemannian manifold. If M is con- 
nected and simply connected, and p € M, then 


(10.21) BEO'(M), f(z) = fe => fe O(M). 
Pp 


Proof. The formula for f is well defined if MM is simply connected, and we have 
(10.22) df = 9, 


hence (10.19) implies the Cauchy-Riemann equations (10.17). 


Let us tie this in with the Hodge theory developed in §8. As in that section, H, 
denotes the space of real-valued harmonic 1-forms on 7. We have 


(10.23) *:H, Hi, ** = —T, 


so * endows the real vector space H, with a complex structure. This leads to the 
following. 


Proposition 10.6. Let M be a compact, oriented, 2D Riemannian manifold. Then 
(10.24) dimg H'(M) = dimg H, = 2g 
is even. 


Proof. In fact, (10.23) gives H the structure of a finite-dimensional complex 
vector space. If its complex dimension is g, then its real dimension is 2g. 


REMARK. Compare (10.24) with the computation in (B.9) of #!(/) for a sur- 
face obtained from S? by adding g handles, established via the Mayer-Vietoris 
sequence. 


Let now He denote the complexification of the real space 111, and extend x to 
act C-linearly on H£. This is a C-linear isometric involution, so Hf decomposes 
into its +7-eigenspaces: 


(10.25) HE = O'(M) @O'(M), 
where O!(M) is as in (10.19) and 


(10.26) O (M) ={B 6 HE : «6 = ip}. 
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We also have 


O'(M) = {atita:a€ Hy}, 
(10.27) ae 
O (M) = {a-ixa:a€e Hj}, 


and 
(10.28) dime O'(M) = dime O (M) = g. 


Elements of O!(M) are called Abelian differentials. As seen in §9 of Chapter 
10, this space (denoted there by O(«)) plays an important role in the study of 
Riemann surfaces, especially in concert with the Riemann-Roch theorem. 


Exercises 


1. Suppose M is an n-dimensional, oriented manifold, with metric tensor g. Let g’ = e“g 
be a new metric tensor. Use these two metrics to define Hodge star operators * and x’, 
respectively. Show that 

a’ = e(2-I™ x on AI(M). 
In particular, if n = 2k is even, *’ = * on A*(M). 

2. Express 5’u in terms of du and other operators, when u € AJ(M), where 6’ is the 
analogue of 6 when g is replaced by g’. Do the same for dé’u, 6’du, and A’u. 

3. Show that ifn = 2, u € A°(M), then Au = 0 if and only if A’u = 0. 

4. If M is compact and n = 2k, show that u € A*(M) is a harmonic form for g if and 
only if it is a harmonic form for g’. 


11. General elliptic boundary problems 


An elliptic differential operator of order m on a manifold / is an operator that in 
local coordinates has the form 


(11.1) P(x,D)u= S~ ag(ax)D°u, 


and whose principal symbol 


(11.2) Pago = Ss aae 


ja|=m 


is invertible for nonzero € € R” (n = dim M). Here P(x, D) could beak x k 
system, or it could map sections of a vector bundle Ep to sections of £. We will 
assume the coefficients of P(x, D) are smooth. If M is the interior of M, a com- 
pact, smooth manifold with boundary, we require the coefficients to be smooth on 
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M, and we also want P,,,(a, €) to be invertible forx ¢ M,é 4 0.1f OM #0, 
there will be various boundary conditions to study. 

First we study interior regularity for solutions to P(x, D)u = f.In Chap. 3 we 
treated this for constant-coefficient elliptic operators P(D). We will exploit the 
technique (which was used in the proof of Lemma 9.3) of freezing the coefficients 
of P(x, D) and using estimates on the resulting constant-coefficient operators. 
Our interior regularity result is the following. 


Theorem 11.1. Jf P(x, D) is elliptic of order m and u € D'(M), P(x, D)u 
= f € H*(M), thenu € Hpt™(M), and, foreachU CC V CC M,o < s+m, 
there is an estimate 


For the proof, we can assume uly, belongs to some Sobolev space, say u € 
H7(V). We will first establish the estimate 


(11.4) lull (vy < Cl| P(x, D)ullar—m vy) + Cllullz-1(v). 


Once this is done, we will establish u € H7*!(V) ifr —m+1 < s, with an 
analogous estimate, following in outline the program used in §1. 
If we pick y € C§°(V), then 


(11.5) P(a, D)(xu) = x(«)P(a2, D)u+ Q(a, D)u, 
where Q(x, D) = [P(x, D), x] is a differential operator of order m — 1, so 


Q(x, D)ulla--mcvy < Cllulla--1(v)- 


Hence, just as in the chain of reasoning involving (1.22), we can localize the task 
of proving (11.4). We can suppose u € H7(V) is compactly supported and that 
V is an open set in R”, and establish (11.4) in that case. 

The next step will be to apply cut-offs with very small support, to effect the 
freezing of coefficients. Let A = Az be the lattice eZ” = {ej : j © Z”}. Take 
a partition of unity on R” of the form 1 = Y) jean X5(x), with xj (x) = xXo(x — 
3), xo(“) € C§°(R”) supported in —1 < x, < 1. Then define a partition of unity 


(11.6) 1= 5) x(a), 


AEA 
with x, (x) = xo((x — A)/e), when A = A.. We will suppress the dependence 
on € in the notation, though of course this dependence is very important. 


Now, for each \ € A, set 


(11.7) P,(D) = P(A, D). 
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This is the constant-coefficient elliptic operator obtained by freezing the coeffi- 
cients of P(x, D) atx = X. If P(a, D)u = f, then 


(11.8) xa(@)P\(D)u = xf — Rya(a, D)u, 
where 


Ry(x, D) = Xa(2) [P@,D) _ Py(D)| 


(11.9) = x) (2) yO [aa(x) _ del) D™, 
Ja|<m 

Therefore 

(11.10) Py(D)(xau) = Xaf — Ry(z, D)u — Q(z, D)u, 

where 

(11.11) Q)(#, D) = [xy, P,(D)] has order m — 1. 


Now the functions P)(€) are all bounded away from zero on a set 
(11.12) {€ ER”: |€| > K}. 
Thus, taking a cut-off y(€) € Cp°(R”), equal to 1 on |é| < K, we can set 
(11.13) Ey(€) = (1— 9(€)) Px(6)°- 
Then, as seen in (9.4) of Chap. 3, 
(11.14) E,(€) € S7™(R"), 


which is to say, there are estimates 


(11.15) IDE < C4 el. 
We have 
(11.16) E)(D)P\(D) = I+ p(D), 


with p,(€) € C§°(IR”). Furthermore, E) and p) are bounded in their respective 
spaces, for all \ € Az, independently of ¢ € (0, 1]. Applying E(D) to each side 
of (11.10), and summing over 4, we have 
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(11.17) 
u= Fy — > {E\(D)Ra(a, D)u+ Ey(D)Q)(#, D)ut+ pr(D)(xxu) }, 
AEA 
where 
(11.18) Fy = So Ey(D)(xaf). 


AEA 
To prove (11.4), we need (11.15) only for a = 0, which implies 
(11.19) E)(D) : H°(R") — H°T™(R”), 


for all o € R, with norm bound independent of A, ¢, and a. Since at this point u 
has compact support in V C R”, all our Sobolev norms in the estimates (11.20)— 
(11.22) below can be taken to be H7(R”)-norms, for various -y. Looking at the 
first term on the right side of (11.17), we see that 


(11.20) |Fallez < C(e,7)||flla-—m- 


In view of (11.11) and the compact support of p)(&), 


(11.21) dF) )Qx(z, D)u+ pr(D)(xru)|| 7 < Cle. 7)llullae—s, 


and by (11.9), we have 


(11.22) [2 AAD)RxC@, D)ul < C(r)ellulla- + C(e,7)llullaz-—, 
Xr 


where C'(r) is independent of c. Thus, when we estimate the H7 -norm of (11.17), 
the term C(r)é||u|| > can be absorbed into the left side, for ¢ > 0 sufficiently 
small. We obtain then the estimate (11.4). 

Passing from (11.4) to H7*+!-regularity of u, given f € H™+!~™, involves 
an argument similar to (1.23)-(1.28). Recall we have u € H7(R”), compactly 
supported. With the difference operators D;, defined by (1.23), we apply (11.4) 
to D;,,u, obtaining 


(11.23) Dj nullz < Cl|P(e, D) Dj nul|¢,--m + Cl|Djnull2;-—s. 


As in (1.24), we replace P(2,D)Dj,n by Dj,nP(x,D) plus the commutator 
[P(a, D), D;,,], and use 


(11.24) Distal) Du = —(D; pte) Dr, 


where 7;,,u(a) = u(x + he;), as in (1.23). Hence 
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(11.25) I|[Dj.n, P(x, D))ullz-—m < Cllullize- 
Thus (11.23) yields 
(11.26) |Dj.nullize S Cllullz- + Cllfliz-—m4s, 
and hence taking h > 0 gives u € H7*1 and 
(11.27) [lull < Cll P(x, D)ullze—mir + Cll file: 


With this advance over (11.4), we have a proof of Theorem | 1.1, by a straightfor- 
ward iteration. 

We turn now to boundary conditions. In addition to having the elliptic operator 
P on M, we suppose there are differential operators B;, 7 = 1,...,@, of order 
m,; <m— 1, defined on a neighborhood of 0M, and we consider 


(11.28) Pu=fonM, Bju=g;on0M,1<j <8 


When M is a compact, smooth manifold with boundary, we seek estimates of 
the form 


[ell Zrm+ (ar) s C\|Pullzze cay 


(11.29) + cy [Bye jpmte—m5—2/2 (940) + OllullFpm+e—1 (a1) 
J 


and corresponding regularity theorems. Such estimates are called coercive esti- 
mates. 

Applying a cut-off as in (11.5), we see that it suffices to establish the estimate 
(11.29) for u supported near OM, indeed, for u supported in a boundary coordi- 
nate patch. 

We now introduce the hypothesis of regularity upon freezing coefficients. 
Given q € OM, pick a coordinate neighborhood © of g, mapped diffeomorphi- 
cally onto a compact subset O’ of R?. = {x € R” : x, > 0}. The operators P and 
B, are transformed to operators on functions on O’. Now freeze their coefficients 
at q, obtaining constant-coefficient operators P,(D) and B,;(D). The hypothesis 
of regularity upon freezing coefficients is that there are estimates 


llullzrmex < Cl] Pa(D)ull ze 


(11.30) + cy. [Big DF cee aye + Olle Fain 
J 
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valid for smooth u with bounded support in R”, with constants C uniform in 
q € OM. Here, |u| ze = Ilell x ee) and |v|zz> = ||v||z*(@n-1). The following 
result reduces the study of (11.29) to the constant-coefficient case. 


Proposition 11.2. Suppose P is elliptic on M and the boundary problem 
(11.28) satisfies the hypothesis of regularity upon freezing coefficients. Given 
ué€ H™(M),k € Z*, if Pu € H*(M) and Bju€ H™+k—-mj—1/2(9M), then 
u € H™**(M), and the estimate (11.29) holds. 


To prove this, let A = A, be the lattice in R” used before, except we restrict 
attention to A € IR’. We use the partition of unity (11.6). For P,(D)(x,u), we 
still have (11.10), and similarly, if \ € R°~! c R”, 


(11.31) Bjx(D)(xru) = xagi — Rja(z, D)u — Q5a(a, D)u, 
where 

(11.32) Rjr(@, D) = xa(#) [Qj (x, D) — Qj(D)] 

and 

(11.33) Q5(a, D) = [xx, Qja(D)] has order m; — 1. 


Thus, granted the hypothesis of Proposition 1.2, for each \ € R"-!MA,, we have 
an estimate 


(11.34) 
Ixxul| erm+e 


< Cl llxa flare + Rx (e, Dull re + Qa(@, Dull are + local arms] 


+c) [lxagilamo.e + |Rj(a, D)ul uc. + 1Q;r(x, D)ul p25. | 
j 


where, in the last three terms, ju(j, k) = m+k—m,;—1/2. If \ € A\R"~!, we can 
estimate ||u||z7+« by the sum of the first three terms on the right. Summing 
over A, we get an estimate for ||w]| pm+s. Note that 


(11.35) S- ||Ry(a, D)ull ze < Ce|lul| zm+e + C(e)||ull zm+e—1, 
a 


as in (11.22). Using the trace theorem, we can also estimate the quantity 
ir Ria(%, D)ulqug.e by the right side of (11.35), and we can also use 
the trace theorem to estimate >? ; \ |Qj,(x,D)u| quc.e). We can absorb the 
Cé||ul| 7m+x into the left side, obtaining the estimate (11.29), given u €¢ H™**, 


11. General elliptic boundary problems 479 


To obtain the associated regularity theorem, we use the difference quotients 
Djn,1 <j < n—1, as in (1.23)(1.32). Given u € H™** while f € H**, 
gi © H™+k-m+1/2, if we apply (11.29) to Dj ,u (localized to have sup- 
port in a coordinate patch) and use (11.24) together with the analogous result 
for [Dj,n, Bi(z, D)], just as in (11.26) and (11.27), we get Dju € H™**, for 
1<j<n-l,and 


(11.36) ||Djull3nse S Cl Puldess +O. [Bit pmen-mirg + Cllulldymnee, 


for 1 < 7 < n. From here, as in (1.29)-(1.32), we proceed as follows. We need to 
know that D,u € H™**, that is, 


(11.37) D°D,ue He, |al<m—1. 


Now if D°D, #4 D?”, we can write D°D,u = D,D®u, with |G] < m— 
1,1 <7 <n-—~1, and conclude from (11.36) that this belongs to H+! with an 
appropriate bound. Finally, to estimate D7" u, we use the PDE Pu = f to write 


(11.38) Dru =a(z)f— 5° ba(x)D%u, 


|a|<m 


where D® # D in the last sum, and then estimate the H**+!-norm of the right 
side of (11.38) by ||aPul|;z.+1 plus the right side of (11.36). This completes the 
proof of Proposition 11.2. 

We now turn to the problem of establishing an estimate of the form (11.30), for 
constant-coefficient operators, that is, 


lull zyme < Cl|P(D)ull 


11.39 
+ C2 |B;(D)u) 2 m+n—m;—1/2 + Cllullzpm+e—s- 
g. 


We will take u € S (R"), that is, u will be the restriction to Rv of an element 
of S(R”). Also, we will assume that wu vanishes for x, > 1. It is convenient to 
relabel the coordinate variables; set 7 = (%1,...,%n—1), y = Yn. We write P(D) 
in the form 


Aj(Dz) 2G) order A;(Dz) =m-— 9. 


gm m-1 fad 
(11.40) P(D)=~—+ >> 

Oy™ = yy 
We convert P(D)u = f toa first-order system; set v = (v1,...,Um)', with 


(11.41) v, = A™1u,... ,Uj = kg, aay Oia = Ofte 
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Then P(D)u = f becomes the system 
(11.42) <" = K(D,)v+F, 


where F' = (0,...,0, f)’ and 


0 A 
0 <A 
(11.43) k= te Me, , 
A 
Co Cy Co COn4 
where 
(11.44) Cp= =A Dh, 


As in Chap. 4, we define A : H* — H*—! by 


(11.45) (Au)(€) = (€)a(€). 


Note that the matrix entries of K are not differential operators, though they are 
well-behaved Fourier multipliers: 


(11.46) K : H9(R""") 3 H°"1(R”™"}), 


In fact, K € S}(R"~?), that is, estimates of the form (11.15) hold for D°K(E), 
with —m replaced by 1. This fact will be explored further in Chap. 7. Now let us 
note that K(€) = Ky(€) + Ko(&), where Ko is bounded and kK ,(£) is homo- 
geneous of degree 1; A (€) has the form (11.43) with A replaced by || and A; 
replaced by the principal symbol A} (€), homogeneous of degree m — j. 


Lemma 11.3. The operator P(D) is elliptic if and only if, for all nonzero € € 
R”~!, Ki(€) has no purely imaginary eigenvalues. 


Proof. Indeed, det (i7 — Ky(£)) = Pn (€, 1) is the principal symbol of the oper- 
ator (11.40) if P(D) is scalar. If P(D) is a k x k system, the equivalence of 
P,,,(€,) having a nonzero eigenvector in C* and of in — K,(£) having a nonzero 
eigenvector in C*™ follows by the same reasoning as the reduction of P(D)u = f 
to (11.42). 


We also rewrite the boundary conditions B;(D)u = g; at x, = 0. Let 


k; 
(11.47) B; = By(D g as bya(Da) 


em) Oy 
y k<mj; 
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Then we have the boundary conditions 


(11.48) >> djg(Dz)A™—*y, (0) = AT" g, Shy, 137 <L 


kam; 
which we write as 
(11.49) B(D,.)v(0) = h, 


with B(é) € SP(R"“!). 
The estimate (11.39) translates to the estimate 


(11.50) llollznai < Ol|Lullze + C|Bu(0) [Fase + Cllullze, 
where 
(11.51) by = [2 —K(D Jv 

ry = Oy x ’ 


and we assume v € S(R”), with u(y) = u(y, -) = 0 for y > 1. 

We want to decouple the (11.42) into a forward evolution equation and a back- 
ward evolution equation. Let 7 = y(&) be a curve in the right half-plane of C, 
encircling all the eigenvalues of K,(£) with positive real part, and set 


(11.52) Eo(€) = = IG — Ky(£))"d¢. 
y 


Then £(£) is smooth on R"~! \ 0, homogeneous of degree zero, and, for each 
&, it is a projection onto the sum of the generalized eigenspaces of A, (€) corre- 
sponding to eigenvalues of positive real part, while J — Eo(€) similarly captures 
the spectrum of /1(€) with negative real part. If we set 


(11.53) Ax (€) = (2Fo(€) — 1) Ki (6), 


then A;(£) is homogeneous of degree 1 in € and its eigenvalues all have positive 
real part, for € 4 0. We want to construct a new inner product on L?(R"~*) with 
respect to which — A;(D,) is “dissipative.” 


Lemma 11.4. Let Miz denote the space of complex K x K matrices with spec- 
trum in Re z > 0, and let PE be the space of positive-definite, complex K x K 
matrices. There is a smooth map 


(11.54) ®: Mt — Ph, 


homogeneous of degree 0, such that 
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(11.55) Ae Mk, P=®(A) = PA+A*PeE Pi. 


Proof. First we observe that if Ag € My is fixed, there exists Py € Pr such 
that Pp Ao + Aj Po € Pe To see this, use the Jordan normal form to make Ao 
similar to B-, where B, has the eigenvalues of Ao on its diagonal, es and Os right 
above the diagonal, and Os elsewhere. Pick ¢ small compared to the real part of 
each eigenvalue. Declaring the basis of C* with respect to which Ag takes the 
form B- to be orthonormal specifies a new Hermitian inner product on C*, of the 
form ((u,v)) = (Pou, v), where (u,v) is the standard inner product, and this Po 
works. 

Thus, given A € Mi, the set P(A) of P € P# such that PA+ A*P Ee Pt 
is nonempty. One readily verifies that P(A) is an open convex set. Furthermore, 
given P € PZ, the set of A € Mf such that PA + A*P € Pi is open. The 
existence of ® satisfying the conditions of the lemma now follows by a partition 
of unity argument. 


Corollary 11.5. Given A (£) constructed by (11.53), there exists P9(€), smooth 
on R"~! \ 0, homogeneous of degree 0, such that both Po(€) and Po(€) Ai (€) + 
Ai (&)Po(&) are positive-definite, for all € A 0. In fact, for some a > 0, 


(11.56) Po(§)Ar(€) + Ai(€)Po(€) = alé|Z. 


With (u, v) denoting the inner product in L?(R"~1), we have 


(11.57) 5 (PoAN Eo At? Eov) = 2 Re (PAY. A? Eyv) 


= 2 Re(PyEy KA'/?v, A'/? Eu) +2 Re(PoA!/* Ey F, A/? Ev), 


given Lu = F. Now Eo = (2Eo — I) Eo implies Po Eo Ki = Po Ai Eo, so 


(11.58) 
2 Re (PoEyK A1/?v, A4/? Equ) = ([Po Ai + A* Po] EpAl/?0, EyAl/2v) 


+2 Re(PyA'/? Eg Kou, AY?v). 
Thus integrating (11.57) over 0 < y < 1 and using (11.56) yield 


|Eov(1) |i /2 — |Eov(0)|7 72 
(11.59) 1 
> CllEov|lio1) — of Flo: loy)la dy — C’llollfo,12); 


where, for simplicity of notation, we have set 


[wls = levies aen—1s 
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and we define 


1 1 
(11.60) Velo. = fen dy = fda) Rbaca—y 


More generally, it will be convenient to define the Sobolev-like spaces H(j,,5)(Q), 
for Q = [0,1] x R"“1, by 


k 1 
ais!) ilk) = : DIA u(y) [Bagger dy. 
7=0 


Note that H,i.,9)(Q) = H*(Q). We also note that the standard trace theorem 
generalizes naturally to 


(11.62) Tt: Hy,,s)(Q) — HPtS-V/2(R"™-1), ru(2') = u(0,2’). 
Changing the constants C' and C’, we can replace IlvllZo 1/2) in (11.59) by 
IlvllZo aye for any 0 < 1 (e.g., 0 < 0). Also, we can write 


i 
E 1 
ff Fale: loedh a < Sella) + 5 oy 
and picking ¢/2 < C'/2C”, obtain from (11.59) the estimate 
(11.63) |lull(o,1) + [Eov(0)|t/2 < CllLull(o,0) + ClZov(1)Ii/2 + Cllull(o,0): 
Replacing v by A*v, we have the estimate 
|| Zovll(o,s+1) + |£ov(0)/3 41/2 
(11.64) 


< C||Lo| (0,8) + C|Eov(1) sut/2 + Clllléo,0); 


for any 0 < s+ 1. Similarly, with Ey; = I — Ep, 


|Exvll(o,541) + |E£y (1) st1/2 


(11.65) > 7 2 
< Cl|Lv (0,8) + C|E,v(0) s+1/2 + Cllull(o,0)- 


Summing the last two estimates, we have 


Ile llZ0,541) + |Eov(0) st1/2 + |F10(1)|341/2 


(11.66) 
& C||Le|léo,s) + C|£ov(1) S+1/2 + C|£,v(0) s+1/2 + Cllullio,c): 
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Since |v||Z,.) = Dyvllio,s) + llvllfo,s41)» and Ov/Oy = Kv + F, we hence 
arrive at the estimate 
lola.) + |Eov(0) ot /2 +|E v(1) [3412 


(11.67) 
< Cll Lo||G,5) + ClEov(1) [3442 + C|E10(0) [2442 + Cllulléo,)- 


We can now give a natural condition for the estimate (11.50) to hold. In fact, 
(11.50) is the s = 0 case of the following. 


Proposition 11.6. For any k € Z*,s € R,o < 8, there is an estimate 


(11.68) leli(x,s) S ClZella—1,) + CIBC) lk+s—1/2 + Cllell(o,o); 
forallv € S(R”) vanishing for y = 1, provided that for all s there is an estimate 
(11.69) IglS < ClBglS + CilEZogl; + Cilgls—1, 

forall g € S(R"~). 


Proof. First take k = 1. Since (11.67) holds for v € H(;,,)(Q), substitute Eov 
for v in this estimate. Then EF, Eov(0) = 0, and LEgv = (0, — K)Eov = 
EoLv — KoEov. Thus we obtain 


|| Zovllts,4) + Zov) [3412 S CllLallfo,) + ClEov (1) 41/2 + CllZoull{o,.)> 
and hence 

(11.70) |Eov(0)[o41/2 S$ CllLZull(o,.) + ClHov()le41/2 + Clloll(o,0)- 
Now use (11.69), with g = v(0) and s replaced by s + 1/2, to obtain 


|v(0) ie < C|Bv(0) Pips “I Cl|LllZo,s) 


(11.71) 
+ C|Eov(1) [2341/2 + Cllello,0): 
We can dominate the term C|F,v(0) Pays on the right side of (11.67) by (11.71), 
and if v(1) = 0, this yields the k = 1 case of (11.68). 

Then, making use of Ov/Oy = Kv + Lv, one gets (11.68) by induction for 
k > 1. This completes the proof of the proposition. 


Now B(D,,) and Eo(D,,) are Fourier multipliers by functions B(£) and Fo(&). 
The latter function is homogeneous of degree 0, while the former belongs to 
S9(R"~+). In fact, we can write B(E) = bo(€) + b,(€), where bo(€) is homo- 
geneous of degree 0 and |b,.(€)| < C(€)~!. By the characterization of H*(R"~') 
in terms of behavior of Fourier transforms, we have the following. 
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Lemma 11.7. Suppose (1.1) isak x k system, so K, B, and Eo act on functions 
with values in C’,v = km. The estimate (11.69) holds if and only if, for each 
€ #0, there is no nonzero v € C” such that 


(11.72) bo(€)u =0 and Eo(&)v = 0. 


Note that this is an “ellipticity condition” for some operators that are not dif- 
ferential operators. This is another point to which we will return in Chap. 7. 

We want to make the condition for regularity even more explicit by relating it 
directly to the symbols of P and B;. We establish the following. 


Proposition 11.8. For given nonzero € € R"~1, the condition that there is no 
nonzero v € C” satisfying (11.72) is equivalent to each of the following two 


conditions: 
(i) There is no nonzero bounded solution on (0, co) of the ODE 


(11.73) — _ Ki(é)p=0, bo(€)y(0) =0. 


(ii) There is no nonzero bounded solution on [0, 00) of the ODE 


d” m-1 


a = di 5S, ; 
(1.74) Fae + x AQ p5%= 0. B, (£,d/dy) ®(0) =f 1<7<e 


Here A,(€) is the part of A;(&) of (11.40) homogeneous of degree m — j, and 
B; G d/dy) comes from taking the part of (11.47) homogeneous of degree m,, 
and replacing D, by €. 


Proof. The equivalence of the hypothesis of Lemma 11.7 to (i) comes because 
the solution to (11.73) has the form 


oy) = (0), 


and this is bounded for y € [0, 00) if and only if Fo (&)y(0) = 0. The equivalence 
of (i) and (ii) arises because (11.74) becomes (11.73) when transformed to a first- 
order system. 


It is also useful to note that we can replace (ii) by: 

(ii) There is no nonzero solution to (11.74) that is rapidly decreasing as 
Yy — +00, 
and make a similar replacement of (i). 

Since we want to consider boundary problems for which there will be a 
reasonable existence result as well as a regularity result for solutions, it is nat- 
ural to consider a further restriction on the boundary condition. Suppose that 
B,(€,d/dy)®(0) € Cs, so 
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bo (€): CY 3 C*, AHA +--+ +Ag. 


Proposition 11.9. For given nonzero £ € R"“!, the following three conditions 
are equivalent. 


(i) Givenn € C, there exists a unique bounded solution on |0, 00) to the ODE 


d 
(11.75) Ty 7 Bi)e=0, bol(8)(0) =n. 
(ii) Given n; € C4, there exists a unique bounded solution on {0, 00) to the 
ODE 
d™ + d' ~ d 
11.7 ——® A;(€) —® — B;( €, — ) ®(0) = n;. 
(11.76) ®t AUT e-0, B,(E 77) #0) = 04 


(iii) With V (€) denoting the null space of Eo(€) on C”, 
(11.77) bo (€) : V(E) —+ CP isomorphically. 


Proof. The argument here is the same as in the proof of Proposition 11.8. We also 
note that if these conditions hold, the unique solutions to (11.75) and (11.76) are 
rapidly decreasing as y + +00. 


If the boundary problem {P(D), B;(D),1 <j < ¢} satisfies the conditions of 
Proposition 11.9, it is called a regular boundary problem. More generally, if the 
variable-coefficient boundary problem (11.28) for an elliptic operator P(x, D) 
produces frozen coefficient problems that satisfy this condition, it is called a reg- 
ular boundary problem. 

As a useful tool for establishing regularity, note that if 


(11.78) V(€) = ker Eo(€) has dimension \ for each nonzero €, 


then (11.77) holds if and only if no nonzero v € C” satisfies (11.72). Thus, given 
(11.78), the conditions of Proposition 11.9 are equivalent to those of Proposition 
11.8. Of course, for (11.77) to hold for all € # 0, it is necessary that (11.78) 
be true. 

We now give some examples of regular elliptic boundary problems. Our list 
will include those studied in §§1, 7, and 9, as well as others, not readily amenable 
to the methods developed there. 

We begin with operators P(x, D), which are strongly elliptic, of order m= 21. 
By definition, this means 


(11.79) Re P(x, €) = Cl€|™, 


with C > 0. If Pisak x k system, Re P,,(x,&) stands for the matrix-valued 
function (Pm (x, €)+Pm (x, €)*)/2. The Dirichlet boundary condition in this case 
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can be written as follows. Let 0/Ov denote any vector field defined on a neigh- 
borhood of 0M and everywhere transverse to 0M. Then the boundary condition 
is 


O\H-1 
(11.80) ie 6 =n. (5) u=gu-1 on dM. 


oO 
Ov Ov 


If pp = 1, this reduces to u = go on OM, as in §1. regular 


Proposition 11.10. /f P is a strongly elliptic k x k system of order 21, then the 
Dirichlet boundary condition is regular. 


Proof. Since (11.78) holds in this case, it suffices to check the uniqueness, 
namely, that any solution ® to (11.74) that is rapidly decreasing as y —> +00 
is 0. To see this, write 


(11.81) 
(Pn G iF )e, 2) L2(R+) ~ h G iF )s, a (< i-)®) L2(R+)’ 


where L; and M,; are differential operators (with coefficients depending on €) 
in y of order < ju. Then, by Fourier analysis, if (0) =/(0) = --- = @¢—-) 
(0) =0, we have (11.81) equal to 


(11.82) / Pm(€,m)|®(n)|? dn. 


Here (7) is defined by extending ®(y) to be zero for y < 0. Since 
Re Pm(&,n) 2 C(|g” + |nl”™), 


if Pn(€,id/dy)® = 0, this implies ® = 0, as desired, proving the proposition. 


The Dirichlet problem is regular in many additional cases. For example, if 
P(a, D) isa scalar elliptic differential operator on M, then the Dirichlet problem 
is always regular, provided dim M > 3. See the exercises. 

The next result contains the fact that the Neumann boundary problem for the 
Laplace operator is regular. Let MM have a smooth Riemannian metric. 


Proposition 11.11. Jf X is a real vector field on OM which is everywhere 
transversal, then the boundary condition Xu = g, on OM is regular for the 
Laplace operator A. 


Proof. To freeze coefficients at a point p € M, pick normal coordinates centered 
at p, with O/Oy coinciding at p with the unit normal given by the metric tensor. 
We have (11.78), with A = 1. Checking uniqueness of (11.74) comes down to 
looking at solutions ®(y) to 
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11. 
(1183) 7 


&®-Q(E)\@=0, B'(0) + 7A(E)(0) =0, 


which are bounded for y € [0, 00). Here Q(€) is a positive definite quadratic form, 
B is a nonzero constant, and A(€) a real linear form in €. For € 4 0, any bounded 


solution must be a multiple of e~4V Q(§) | which has boundary data 


(11.84) —By/Q(é) + iA(E) £ 0. 


This proves the proposition. 


When X is orthogonal to 0M, this is the Neuman problem; otherwise it is 
called an oblique derivative problem. Note that if dim M = 2, so € € R?, then one 
gets a regular elliptic boundary problem for any real vector field that is nowhere 
vanishing on 0M, since then (11.84) holds for all € £ 0 as long as either B 4 0 
or A # 0. Compare Exercises 4-9 in §4 of Chap. 4. However, when dim M > 3, 
so € € R"-! withn — 1 > 2, if B = 0, then A(£) = 0 also for € in a hyperplane, 
so regularity fails then. 

We start our next line of analysis with an obvious comment. Namely, the direct 
sum of two regular elliptic boundary problems of (the same) degree m on M is 
also regular. By the same token, if the frozen-coefficient problems all break up 
into direct sums of regular problems, then they are regular, and hence so is the 
variable-coefficient problem that gave rise to them (even though it may not break 
up into such a direct sum). This applies to the Hodge Laplacian A on A*(M), 
with either relative or absolute boundary conditions. In each case, the frozen- 
coefficient problems can clearly be seen to break up into direct sums of Dirichlet 
and Neumann problems. This proves: 


Proposition 11.12. [f A is the Hodge Laplacian on A¥(M), then both the relative 
boundary problem 

(11.85) Au=fonM, j*u= 90, j-du=gi onOM, 

and the absolute boundary problem 


(11.86) Au=fonM, ulv=gqo, (du)|v = gq; on OM, 


are regular elliptic boundary problems. 


In (11.85), g; are forms on OM, of degree k — j. In (11.86), g; are sections of 
the subbundles of AX~!*4(M)|_,,,. defined by g; |v = 0. 

Sometimes the fact that the Dirichlet problem is regular can be used as a tool to 
determine whether another boundary problem is regular. To illustrate this, suppose 
P = P(«a,D) isa strongly elliptic, & x k& system of order 2. Then the ODE in 
(11.76) takes the form 
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11.87 ae a 62 
(11.87) aye? + Aas? = 0 
Let us consider a boundary problem of the form 
Ou 
(11.88) Pu= f, Bo(x)u = go, Colz) a, + Crile, De)u = gn. 


Here we use coordinates (y, 2) on a collar neighborhood of 0M, 
(11.89) Bo(x):C* —+C*, Co(x):C® — C%, 


and C;(a, D,.) is a first-order differential operator whose coefficients map C* + 
C2. The hypothesis (11.78) is equivalent to 4; +2 = k. To complement (11.87) 
and reproduce (11.76), we have 


(11.90) Bo®(0) = 0, Co®'(0) + Ci(E)®"(0) = m, 
where By : C* + C*1, By: C* > C*2, and C,(£) = >> A;€;, with A; : ck > 
C2. These arise from freezing the coefficients of By(x) and C(x, D,). 
Now, for z € OM, € R"~! \ 0, we define a map 
(11.91) B(x, £):C* —> CF 


as follows. Given y € C*, let ®¢(y) be the unique bounded solution to (11.87) 
such that ®¢(0) = y, and then set 


(11.92) N(x, 8p = 8,0), 
and define 
(11.93) B(x, €)p = (Bo(x)p, Co(a) N(x, €)p + Cr(x, €)y). 


The following is an immediate consequence of Propositions 11.9 and 11.10 and 
their proofs. 


Proposition 11.13. Jf P is a second-order, strongly elliptic k x k system, then 
the boundary problem (11.88) is regular provided that, for allx € OM, € € 
R"~+ \ 0, the map B(x, €) in (11.91) is an isomorphism. 


Note that the proof of Proposition 11.11 can be regarded as a special case of 
this argument, with k = 1. Then B(2,£) (with x suppressed) is given by (11.84). 
It is appropriate to think of B(a,€) and N(x, €) as defined on T*(0M) \ 0. In 
Chap. 7 we will see that N(x, €) is the principal symbol of an important pseudod- 
ifferential operator. 

To close this section, we say a little more about regularity estimates. There 
are advantages in using spaces like H(;,,)((2) to formulate regularity results of a 
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more general nature than in Proposition 11.2, for regular elliptic boundary prob- 
lems. Thus, take a collar neighborhood Q of 0M, diffeomorphic to [0,1) x 0M, 
sitting inside a larger collar neighborhood C, diffeomorphic to [0,2) x 0M. We 
use norms H,;,,)(@), given by 


1 
(11.94) lul2y,5) = | u(y, Brecon au 


and more generally 


k 1 ; 
(11.95) llullZ.5) =o i | D3aG,-) || pevencomy oH 
j=0 


Norms on H,,s)(C) are analogously defined. These spaces depend on the choice 
of collaring, but that will not cause a difficulty. Techniques used above are readily 
extended to prove the following. 


Proposition 11.14. [f P(z,D) has order m and {P(x,D), B;(x,D), 1 < 
q < £} defines a regular elliptic boundary problem, then, given that 


(11.96) tb SAG ey(C); 

for some o € R, and given 

(11.97) P(«,D)u=f€ He, (C), B;(x, Due H™+*-™—2+5(9M), 
it follows that 

(11.98) u € Heum+n,s)(Q); 


with a corresponding estimate. 


Part of the usefulness of this extension of Proposition 11.2 arises from the 
following fact. 


Proposition 11.15. /f P is a differential operator of order m, for which OM is 
noncharacteristic, then, for some o € R, 


(11.99) ué L*(M), Pu= f € L?(M) = ue Hone)(Q). 


Proof. Using an expansion like (11.40) for P, we have 


m-1 


(11.100) Oru=f— > Aj(y,2, Dr) Bu. 


j=0 


If the hypotheses of (11.99) hold, then 
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Aj(y, 2, Dr) Bu € HAI, H-™*4(0M)), 


where J = [0,1]. A solution v; to Ov; = Aj(y,2, Dz) O}u hence belongs to 
the space H"~/ (I, H~™*I(9M)), so u € Hy 1~m)(Q). Iterating this argument 
gives (11.99). 

Thus Proposition 11.14 is applicable to such u € L?(M). Note that the 
boundary value B;(x,D)ul|an is well defined when uw satisfies the conclusion 
of (11.99). 

We stated that part of the point of putting a further restriction on the boundary 
conditions, as in Proposition 11.9, to define a regular elliptic boundary problem, 
is to have an existence result as well as a regularity result. In fact, the following is 
true. 


Proposition 11.16. [f {P(x, D), Bj(z,D),1 < j < €} defines a regular elliptic 
boundary problem, with 


(11.101) P(a, D) : C®(M, Eo) —+ C™(M, EF) elliptic of order m 
and 

(11.102) B;(x, D) : C°(M, Eo) —+ C™(OM, G;) of order m,, 
then, for each k > 0, the map 


£ 
(11.103) T: H™+*(M, Ey) — H*(M, EB) ® DP H™t*-™—-1/2(9M, G;) 


j=l 

defined by 

(11.104) Tu = (P(e, D)u; Bi(e,D)u, ...,.Bz(2, Dya) 
is Fredholm. 


The estimate (11.29) clearly implies that T' has finite dimensional kernel. Also, 
by Proposition 6.7 of Appendix A, the estimate implies that T' has closed range. 
It remains to show that the range of T' has finite codimension. 

One way to do this is to construct a right Fredholm inverse of T’, by the results 
of §7 in Appendix A. Pseudodifferential operators, introduced in Chap. 7, form a 
convenient tool to do this. At this stage it is convenient to make a weaker con- 
struction, of something that might be called an “approximate Fredholm inverse” 
of T’. The operator we will construct will be called S: 


£ 
(11.105) S$: H*(M,E,)@ GQ Am ™—1(9M, G;) — H™** (Ep). 


= 
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The function u = S(f;91,..., ge) is to be an “approximate solution” to 
P(z,D)u= f, B (a, D)\u= g,. 


To begin, we ignore the boundary condition. Suppose M C Q, on which P is 
elliptic. Let f € H*(U, E,) be an extension of f. Use a partition of unity to write 
a as a sum of terms ih with support in coordinate charts V, on (2. Then pick a 
lattice A = A,, as in (11.6), and set 


(11.106) ty = Xv >. Ex(D)(xaf)s 
AEA 


where the sum is as in (11.18), and y, € C§°(Q) is equal to 1 on V. Now set 
v=>> vp| ur: Note thatv ¢ H m+k The arguments yielding such estimates as 
(11.20)-(11.23) also give 


(11.107) Po — file Sellfllae + COM lla 


Of course, v depends on ¢. Let h; = B; (a, Dy | was 
We want u = v + w, where w is an approximate solution to 


P(x, D)w = 0, B,(x, D)w = g; — hj. 


Cover a collar neighborhood of 0M in M with coordinate charts V ,, straightened 
out to be regarded as regions in IR’). Write e; = g; — h; as a sum of terms e;, 
supported in V, 1 OM, using a partition of unity. Again pick a lattice A = Az. If 
\€ ANR"! = Ao, we take w,, to be the Fourier transform (with respect to €) 
of the solution to (11.76), with n;(€) = €;,,(€), where ej;,, = X€;v. Then set 


w= Sox S- Wy)- 


Vv AE AG 


Parallel to (11.107), or to (11.35), we have, for yp = (f;91,---, 92), 


where /;, is the range space of T’ in (11.103), and one obtains V;,_1 by replacing 
the index k; by & — 1 at each occurrence. Now this estimate implies that the norm 
of [I — TS] € L(Vx)/K(V;x) is < ©. As long as it is < 1, we have [Ts] invertible 
in this quotient algebra, hence T'S is a Fredholm operator, with a two-sided Fred- 
holm inverse F’. But then SF is a right Fredholm inverse of 7’, and the proof of 
Proposition 11.16 is complete. 

Recall that in previous sections we have obtained existence results by differ- 
ent means. Some of these methods will be pushed in the next section, leading to 
an independent proof of the surjectivity, or “almost” surjectivity, of T’ in many 
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important cases. In §12, the proof above that T in (11.103) has range of finite 
codimension will not be used. 


Exercises 


1. If P(x, D) is a strongly elliptic operator of order 2m, show that 
Re (P(a, D)u, u) = Cull Frm (a2) = C"llullz2cn 


for u € Co°(M). (Hint: Use cut-offs as in the proof of Theorem 11.1 to reduce to 
an estimate on the quantity Re (P,(D)u,u), where P,(D) is obtained by freezing 
coefficients. Analyze this inner product via Fourier analysis.) 

2. For strongly elliptic P(x, D), show that, for C sufficiently large, 


(11.109) P(a,D)+C, : Hq’(M) — H~™(M), isomorphically. 
3. Parallel arguments of §1 to show that 
(11.110) P(a,D)+C : Hp"**(M) — H*(M), isomorphically, 
where H(M) = H‘(M) 9 Ho"(M). Deduce that 
P(x, D) : H7"t*(M) —> H*(M) is Fredholm, 


of index zero. 
4. As an alternative, show that (11.109) leads to (11.110) via Propositions 11.14—11.15. 


In Exercises 5-7, let P(x, D) be a scalar elliptic operator of order m, on Q = 
[0, 1] x OM. Let Py (a0, €0, 7) be the principal symbol at x9 € OM, & € T;,0M \ 0, 
7 € R. Then none of the roots 71, ..., 1m Of Pm(xo, €0,7) = 0 are real. Let 


£ 
M* (xo, €0,7) = Il@- mx (%0, €0)) 


the product being over & such that 7.(xo0,0) have positive imaginary part. Let 
B;(ao, €o,1) be the principal symbol of B;(«.D). 

5. Show that the conditions for regularity of (P, B;) in Proposition 11.9 are equivalent to 
the condition that the set of polynomials in 7 


(11.111) {B; (x0, €0,n) :1<j5 <4 
gives a basis of 


(11.112) C[n]/(M* (20, €0,7)); 


the quotient of the ring C[n] of polynomials in 7, by the ideal generated by 
M* (xo, £0; 7) 

(Hint: Show that a solution ® to (11.76), obtained by freezing coefficients at xo, is 
bounded if and only if 
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mt (x0, 0, )* =0.) 

We say that P(x, D) is properly elliptic provided the degree of M* (xo, £0, 7) in 77 is 
independent of (xvo,é0) € T*OM \ 0. Evidently, if (11.111) is to provide a basis for 
all (xo,€0) € T*OM \ 0, then P(x, D) must be properly elliptic, since the quotient 
(11.112) is a vector space whose dimension is the degree of M* (xo, €0,7). 

6. Show that any scalar elliptic P(x, D) is properly elliptic if dim M/ > 3. 
(Hint: R*—' \ 0 is connected for k > 3.) 
Show that m = 20. 

7. Show that the Dirichlet problem is regular for any properly elliptic scalar opera- 
tor P(x, D) of order m = 2y. (Hint: Show that {1,7,...,7"~'} gives a basis of 
C[n]/(M* (xo, €0,7)) under these circumstances.) 

8. Consider the following second-order elliptic operator on R?: 


O\? 1/0. .d\2 
L= ( =) = ( + 2 ) F 
OZ 4\ 0a Oy 
Show that L is not properly elliptic. Verify that the Dirichlet problem on the disk for 
L is not regular by constructing an infinite-dimensional space of solutions to Lu = 0, 
thos =O ee 
9. Let D = d+ 6, acting on the space BA’ M of forms on M. Let Row = v A u and 
Aou = tu, as in (9.11). Show that the boundary problems {D, Ro} and {D, Ao} are 
both regular. Take another look at Exercise 2 in the first set of exercises for §9. 


12. Operator properties of regular boundary problems 


We want to extend the existence theory, obtained for the Dirichlet and Neumann 
problems for the Laplace operator in §§1 and 7 and for relative and absolute 
boundary problems for the Hodge Laplacian in §9, to further classes of ellip- 
tic boundary problems. We also study other properties of an elliptic operator 
P = P(x, D), regarded as an unbounded operator on L?(M, Eo), with domain 


(12.1) D(P) ={ue H"(M, Eo): B;(z,D)u=00ndM,1 <j < Gh. 


We begin with strongly elliptic, second-order k x k systems. Note that, in each 
case studied in §$1, 7, and 9, we had (up to sign) the form 


(12.2) P=D*D+X, 
where 


(12.3) D:C™(M, Ey) —+ C™(M, E;) has injective symbol, 
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that is, op(x,€) : Eox + Ez is injective, for each x € M, € # 0. If the bundles 
E; are endowed with metrics and M has a Riemannian metric, then D* is defined, 
and D* D is elliptic. Set LD = D*D,soP=L+X. 

An important tool in the analysis done in 881, 7, and 9 was Green’s formula, 
which in this generality can be written as 


(12.4) (Lu, v) = (Du, Dv) + : Jon: (x,v)Du, v) dS, 
aM 


for sufficiently regular sections u, v of Eo. The boundary integral vanishes for all 
v € C™(M, Eo) if and only if 


(12.5) op«(z,v)Du=0 on dM. 


The approach to the Neumann boundary problem in §7 started with the fact 
that ||du||72 + ||u||Z2 defines the square H'(M)-norm, to establish Proposition 
7.1. There exist first-order differential operators D for which the estimate 


(12.6) ||Dul|?2 > Cllullz — C’|lullz., ue H'(M), 


is true, but not straightforward, as |Du(x)| does not pointwise dominate a multi- 
ple of |Vu(x)|. There are also first-order elliptic differential operators for which 
(12.6) is false. We give here a sufficient criterion for the validity of (12.6). 


Proposition 12.1. [f (12.5) is a regular elliptic boundary condition for L = 
D* D, then the estimate (12.6) holds. 


Proof. It is convenient to give this a functional analytic formulation. Let D; be 
the unbounded operator from L?(M, Eo) to L?(M, E,) with domain 


(12.7) D(D,) = {ue L?(M, Ep) : Due L?(M, Fy)}, 

and D,u = Du for such u; Dy, is the “maximal” extension of D; it is a closed, 
densely defined operator. Clearly, H'(M, Eo) C D(D1). The estimate (12.6) is 
equivalent to 

(12.8) D(D,) = H'(M, Ep). 

To establish this, we define an unbounded operator £ on L?(M, Eo) by 


D(L) = {u € H?(M, Eo) : op-(x,v)Du = 0 on OM}, 


(12.9) 
Lu = Lu= D* Du, foru€ D(L). 


It is clear that 


(12.10) DL) C DDD). 
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In fact, an element u € L?(M, Eo) belongs to D( Dj D,) if and only if 


(12.11) 
Du € L?(M, E,), D*Du € L?(M, Eo), and op«(z,v)Du=0o0n OM. 


Note that Proposition 11.15 implies that the boundary condition makes sense for 
u € D(D}Dyj). It also implies that the regularity result of Proposition 11.14 is 
applicable, so D(D}D,) C H?(M, Eo). Hence 

(12.12) DID, =C. 


By von Neumann’s theorem, D}D is automatically self-adjoint; see 88 in 
Appendix A. Thus Z is self-adjoint. Furthermore, 


(12.13) D(D1) = D(L?); 

a proof of this is given in §1 of Chap. 8. By interpolation, we have 
(12.14) D(L/?) c H'(M, Ep), 

establishing D(D,) C H+(M, Eo) and hence (12.8). 


An important example of this phenomenon is the operator that associates to a 
vector field X its deformation tensor, a tensor field of type (0, 2) defined by 


(12.15) (Def X)(Y, Z) = =(Vy X,Z) + 5(V2X, Y); 


1 
2 
in coordinate notation, 


(12.16) (Def X) j, = 5( 


X54 + Xk:;). 

This was introduced in (3.35) of Chap. 2. We have 

(12.17) Def: C°®(M,T) —+ C™(M, 8°T*). 
If w € T™* corresponds to u € T' via the metric tensor, then 
(12.18) ove(2,é)u= 5(€@ H+ H@§) = Oi. 


We also have 


((u, €)w + (w, €)v), 


NlRe 


(12.19) — oper (2, €)(0Ow) = 


and hence, for L = Def*Def, 
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(1220) au (2,8)u=5(IePut (Eu)é) = SleP(E + Pode 


where P is the orthogonal projection parallel to €, if 7’ and 7™ are identified via 
the metric tensor. 


Proposition 12.2. The boundary condition 
(12.21) O nef: (@,V)Def u = g 
is regular for L = Def" Def. 


Proof. We will apply Proposition 11.13. For a point pp € OM, choose local 
coordinates so that the normal is 0/Ox, = 0/Oy. Then the symbol of D* D is (up 
to a factor of 1/2) 


(12.22) (I+ Pa)n? + (1+ Pel€l?. 


Here we are replacing € € R” in (12.20) by (€,7), € € R”~!. Thus, referring 
to the notation of (12.20), P, stands for P(o,1), and Pe here stands for Pr¢.o). 
Consequently, the quantity N’(x,€) used in the proof of Proposition 11.13, and 
defined by (11.92), is seen to be 


(12.23) N(2,€) = [oP, + Pr(I+6@Pe)||g|, w= = B=v2-1. 


Here P+ = I — P,,. Note that the range of Pz is contained in that of P+, and so 
P,,, P+, and Pz in (12.23) all commute. 


In the present case, B(x, €) has the form Co(x)M (ax, €) + Ci (a, ). In fact, a 
calculation gives 


n-1 
(12.24) 2B(x, Ep = (I+ Pa)N(2,6)9+4>— ongjey, 
j=l 
where {e; : 1 < 7 < n— 1} is the standard basis of R”—!. In matrix form, 
281 
= Pr (I+ 6Pe)lé| 
(12.25) 2B(2,£) = a ‘ 


It is clear that the determinant of the right side of (12.25) is 2a(1 + 8)|& 
asserted regularity follows by Proposition 11.13. 


” so the 


Therefore, Proposition 12.1 yields the following. 
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Corollary 12.3. [f M is a compact Riemannian manifold with boundary, then 
(12.26) |X |laacy < Cl|Def X\|z2—y) + Cll XIlz2—, 

for all smooth vector fields X on M. 


This is called Korn’s inequality and is useful in elasticity theory. 
We have the following Fredholm result. 


Proposition 12.4. If P is given by (12.2) and if (12.5) is a regular boundary 
condition for D* D, hence for P, then for k => 0, the operator 


(12.27) T : H*+?(M, Eo) —> H*(M, Eo) @ H*+2(8M, Ep) 
given by 

(12.28) Tu = (Pu,op«(2,v)Dul,.,) 

is Fredholm. 


Proof. Let Hist? = {fu € H*+?(M,Eo) : Biu = 0}, where Byu = 


By(x2z,D)u = op+(x,v)Dul 5 r/- From the proof of Proposition 12.1, we 
know that 
(12.29) L+I:H? —> L?(M, Ep) is bijective, 


since H?, = D(L). By elliptic regularity, 

(12.30) L+I1: H}t? — H*(M, Eo) is bijective. 

Now P differs from L + I by a compact operator K : Hist? + H"(M, Ep), so 
(12.31) P: Ht? —s H*(M, Ep) is Fredholm. 


The Fredholmness of T' is an easy consequence. 


To return to the study of (12.2)-(12.4), we have the following solvability result. 


Proposition 12.5. If D satisfies (12.3) and B, is given by the left side of (12.5), 
then, with L = D* D, 


(12.32) (L+ 1) ® B,: H**?(M) — H*(M) @ H*+1/2(4M), 
isomorphically, and, if P = L + X, X of order 1, 


(12.33) P@ B,: H**+?(M) — H*(M)  H**+/2(@m) 
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is Fredholm, of index zero. 


We next look at existence results for oblique derivative problems for the 
Laplace operator, namely, 


) 
(12.34) Au=fonM, (+ X)u=gonam. 

V 
where X is a first-order differential operator of the form Xu = Yu + pu, with 
Y areal vector field tangent to OM, y € C™(0M), real. Here we will take the 
Green identity 


(12.35) (Au, v) = (u, Av) + B(u, v), 

with 

(12.36) B(u,v) = [-¢ = ots) dS, 
OM 


and rewrite this boundary term as 


(12.37) B(u,v) = ircee Fe x'v) = e& 4: Xu)o} dS, 


OM 


where X° is the formal adjoint of X, with respect to the L?(0.M)-inner product, 
that is, 
Xtu=—-Yu+(p- divY)u, 


where the divergence is taken with respect to surface measure dS on OM. 
We define two unbounded operators on L?(M ), denoted £1, £2. These are 
defined to be —A on their domains, which we specify to be 


D(Li) = {ue H?(M): au + Xu=00ndM}, 
(12.38) a 
D({L3) = {u € H?(M): Ap + Xtu= 0on dM}. 


Proposition 12.6. The operators £L; have the relation 


(12.39) i= £2, 
where L} is the Hilbert space adjoint of £1. Furthermore, with Vu = (Ou/Ov) + 
Xe) as we have 


(12.40) -A@V : H*#?(M) — H*(M) 6 H**1/2(@M) 
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is Fredholm, of index zero, and the annihilator of the image, a priori in H*(M)*® 
H-*-1/2(@M), is a finite-dimensional subspace of C°(M) © C°°(OM) which 
is independent of k > 0. 


Proof. To start, suppose v € D(L7), that is, v € L?(M), and the map D(L,) 5 
ut+ (Liu, v) satisfies an estimate 


(12.41) (Au, v)| < C(x) ||ullz2(u0)- 


By (12.35) and (12.36), this can happen if and only if Av € L?(M) (hence its 
boundary data are well defined), and (u,v) = 0 for all u € D(L1), hence 


(12.42) i (2 a x's) dS =0, forallu€ D(L,). 
V 


OM 


Since there exist u € D(L,) for which ul a 18 an arbitrary element of C°(0M), 
we see that (Ouv/Ov) + X'v = 0 on OM. Now this is a regular boundary problem 
for A, and Proposition 11.14 applies, to give v € D(L2). Clearly, L{ D Loe, so 
this proves (12.39). By the same reasoning, £5 = £3. 

Now consider the map 


(12.43) A:D(Li) — L7(M). 


We know it has closed range, R(£1). Let V C L?(M) be its orthogonal com- 
plement. Then, by definition, V C D(L7) and LT = 0 on V. Since we know 
Li = Lo, the regularity estimates on £2 imply that 


(12.44) R(L1)* is a finite-dimensional subspace of C°°(M). 


From this we deduce that, for k = 0, the range of —A @ V in (12.40), which 
we know to be closed, has orthogonal complement which is a finite dimensional 
subspace W Cc C®(M) 6 C®(OM). Then elliptic regularity implies that, for 
any k € Z+, the annihilator of W in H*(M) @ H**+1/2(9M), which we know is 
in the range of —A @ V acting on H?(M), must be in the range of this operator 
acting on H*+?(M). Consequently, the annihilator of the range of —A @ V in 
(12.40) is exactly W. 

As for the index in (12.40), note that, if V, = 0/Ov + sX, s € [0,1], then 
—A © V, is a continuous family of Fredholm operators, on which the index is 
constant. At s = 0 we have the Neumann boundary condition, which has index 
zero, So Proposition 12.6 is proved. 


The method used above for the oblique derivative problem extends to many 
other situations. For example, suppose P(x, D) is a scalar elliptic operator, of 
order m, and By(a,D),...,Be(x, D) scalar operators defining boundary condi- 
tions, each of distinct orders m; < m, and each noncharacteristic on OM. As 
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indicated in Exercises 5—7 of §11, P(x, D) must be “properly elliptic,” of order 
m = 2¢, and there is an algebraic characterization of regularity. Let P'(x, D) 
denote the formal adjoint of P(x, D). 


Proposition 12.7. If {P(x,D),B;(z,D),1 < j < €} is a (scalar) regular 

elliptic boundary problem of the form above, then there are boundary operators 
t ; a 

Bj(x, D) such that {P"(x, D), Bi(a,D),1 < 7 < ¢} isa regular elliptic bound- 

ary problem, and such that, given v € L?(M), P*(x, D)v € L?(M), 


(P(x, D)u,v) — (u, P'(x, D)v), 


for allu € C®(M) satisfying B;(x,D)u = 0 on OM, 1 < j < &, if and only if 
B;(z,D)v =O00ondM,1<j <8. 


A proof of this can be found in [Sch], pp. 224-237. A related discussion is 
given in [Ag], pp. 134-151. The reader can try it as an exercise. Once this result 
is demonstrated, the arguments used above also establish the following. 


Proposition 12.8. For the regular boundary problem {P(x, D),B;(x,D), 1 < 
j < L} above, if we define P and P'*, closed unbounded operators on L?(M), to 
be P(x, D) and P'(x, D), respectively, on domains 

D(P) ={ue H™(M) : B;(x, D)u = 0 on OM}, 


Jj 


(12.45) : 

D(P’)={ue H™(M): B; (az, D)u = 0on 0M}, 
then 
(12.46) Pt =P, 


where P* is the Hilbert space adjoint of P. Furthermore, with 
Tu= (Pla D)u; By(a, D)u,..., Be(a, D)u), 


we have 


L 
(12.47) T: H**™(M) —+ H*(M) 6 @ Het™"5— 129M) Fredholm. 


j=l 


We leave it to the reader to consider extensions of these last results to systems, 
or elliptic operators on sections of vector bundles. As the examples of relative 
and absolute boundary problems for the Hodge Laplacian illustrate, one natural 
variant for the noncharacteristic hypothesis on B;(x,D) made above is that, for 
LOE OM, 

oB,(o0,V) : Lox) —+ Gja is surjective, 
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where Eo, G; are the vector bundles used in (11.101) and (11.102). 

We postpone until the beginning of Chap. 12 a treatment of natural boundary 
conditions arising for “elliptic complexes” other than the deRham complex. As 
we will see, in other cases one need not get regular boundary problems. 


Exercises 


In Exercises 1—3, we study the oblique derivative problem 


du 


Au = f on M, ap 


+ Xu =gon0dM, 


where Xu = Yu + pu, as in (12.34), and the associated map 
T=—-A@V: H**?(M) — H*(M) @ H***/?(9M) 


of (12.40). Assume IM is connected and 0M # §. This problem was treated via Fourier 
analysis in the case where M is the disk in R?, in Exercises 4—9 of §4, Chap. 4. 

1. If g = 0, show that ker T is the one-dimensional space of constants. 

2. If p > 0 on OM, vp not identically zero, show that ker T = 0. Deduce that T is 
surjective in this case. (Hint: Use Zaremba’s principle, from §2.) 
Our convention here is that v is the outward-pointing unit normal to OM. 

3. Give examples where y changes sign and ker T’ has dimension 1. Can you make ker T’ 
have dimension greater than 1? 

4. In linear elasticity, one considers the elliptic operator LZ on vector fields on Q C R”, 
defined by 


(12.48) Lu = pAu + (A + p) grad div u, 


with boundary condition 


(12.49) SS vjojn =00n AQ, on = A(div w)djx + (5 ee 
; k j 


2g 


where v; are the components of the normal vector v. For what values of A, w € R is this 
a regular elliptic boundary problem? Show that, for such values, one gets a self-adjoint 
operator. 

5. Let M be a compact Riemannian manifold with boundary. Consider the functional 


K(u) = | f(e,detw(o)) dV, f(a,A) =2uTr A? + (Tr A)’, 


arising in linear elasticity. Show that 


DK (u)w = 4y(Def u, Def w) + 2A (div u, div w) = —(Lu, w) + [3 w) dS, 


OM 


where (compare formula (4.26) in Chap. 10 and (4.3)—(4.4) in Chap. 17) 
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Lu = pAu + (A + p)grad div u + 2p: Ric u, 
Bu = X(div u)v + 2poper« (x, v)Def u. 


Show that this leads to the boundary condition (12.49). 
6. Let 2 be a smooth, bounded region in R”, and let Pi (€),..., Px(&) be (scalar) poly- 
nomials, homogeneous of degree m in €. Show that there is an estimate 


(12.50) Ilullzm ay S CD— |Pj(D)ullz2¢a) + Cllullz2ay, 


J 


for all u € H™(Q), if and only if Pi(¢),..., Px(¢), as polynomials in ¢ € C”, have 
no common zeros, except for ¢ = 0. 

Remarks: Under the hypothesis of no common zeros for { P;(¢)} in C” \ 0, there exists 
M such that, for each a with |a| = M, 


ge = De Aja(€)Pi(€), 


for some polynomials Aja(€), homogeneous of degree MM — m. To prove that such an 
estimate holds when {P;(€) : 7 < k} = {€* : |a| = m}, an inductive approach can 
be taken. This would yield a variant of (12.50). See Agmon [Ag] for further discussion. 

7. As noted in the remarks after Proposition 11.11, for P = A on M, of dimension 2, the 
boundary problem B,u = g on OM, with Byu = Xu, X any nowhere-vanishing real 
vector field, possibly tangent to OM at points, is regular. Then the noncharacteristic 
hypothesis of Proposition 12.7 fails. Can you extend Propositions 12.7 and 12.8 to treat 
this case? 


A. Spaces of generalized functions on manifolds with 
boundary 


Let M be a compact manifold with smooth boundary. We will define a one- 
parameter family of spaces of functions and “generalized functions” on , anal- 
ogous to the Sobolev spaces defined when 0M = 9. The spaces will be defined 
in terms of a Laplace operator A on M, and a boundary condition for the Laplace 
operator. We will explicitly discuss only the Dirichlet boundary condition, though 
the results given work equally well for other coercive boundary conditions yield- 
ing self-adjoint operators, such as the Neumann boundary condition. 
Fixing on the Dirichlet boundary condition, let us recall from (1.7) the map 


(A.1) T:H-\(M) — H'(M), 
inverting the Laplace operator 


(A.2) A: Hj(M) — H71(M). 
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The restriction of T to L?(M) is compact and self-adjoint, and we have an 
orthonormal basis of L?(/) consisting of eigenfunctions: 


(A.3) Uj E Hj(M) N Cc™(M), Tu; = —pjuUj, Au; = —Ajuj, 


where ju; \, 0,0 < Aj 7 oo. 
For a given v € L?(M), set 


(A.4) v=) 0G) uj, 0G) = (v, uy). 


Now, for s > 0, we define 


D, = {v € L°(M) : )[6(/) PA} < co} 
j20 


= {ve L(M) (M) : 3 a(j )ay/?uj € L2(M)}. 
j=0 


(A.5) 


In view of (A.3), an equivalent characterization is 
(A.6) D, = (-T)*/?L?(M). 
Clearly, we have 
(A.7) Do = L?(M). 
Also, Dz = T L?(M), and by Theorem 1.3 we have 
(A.8) Dy = H?(M)N H4(M). 
Generally, D,,2 = T’'D,, so Theorem 1.3 also gives, inductively, 
(A.9) Dy, C H**(M), k=1,2,3,.... 
A result perhaps slightly less obvious than (A.7)-(A.9) is that 
(A.10) D, = Hj(M). 


To see this, note that D, is the completion of the space F of finite linear combi- 
nations of the eigenfunctions {u,}, with respect to the D,-norm, defined by 


(A.11) Ilv|l2, = 21 |) |? 3. 
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Now, if v € F, then 


(A.12) (dv, dv) = (v, Av) = $“(v, uj)(uj, —Av) = S¢ 6(9)? Aj, 
SO 
(A.13) lolld, = ldellz2ca2, 


for v € ¥. In fact, D, is the completion of D, in the D,-norm for any o > s. 
We see that (A.13) holds for all v € Dg, and, with D2 characterized by (A.8), it is 
clear that the completion in the norm (A.13) is described by (A.10). 

If the Neumann boundary condition were considered, we would replace \,; by 
(A;) to take care of Ao = 0. In such a case, we would have 


6) 
= {u © H?(M): Ay =0on am}, D, = H(M). 
Vv 

Now, for s < 0, we define D, to be the dual of D_.,: 
(A.14) Di = De pt 
In particular, for any v € Ds, and any s € R, (v,u;) = 0(J) is defined, and 
we see that the characterizations involving the sums in (A.5) continue to hold for 
all s € R. Also the norm (A.11) provides a Hilbert space structure on D, for all 
s € R. By (A.10) we have (for Dirichlet boundary conditions) 
(A.15) D_, = H™\(M). 
Also, we have the interpolation identity 
(A.16) [Ds, Dolo — Doo+(1—0)s> 
for all s,o € R,@ € [0,1], where the interpolation spaces are as defined in 
Chap. 4. 

The isomorphism 
(A.17) A: D512 —> Ds, with inverse T : D,; —> D542, 
obviously valid for s > 0, extends by duality to an isomorphism A : D_, > 
D_,;—2 for s > 0, so (A.17) also holds for s < —2. By interpolation, it holds for 
all real s. 


By interpolation, (A.9) implies 


(A.18) D, C H*(M), fors>0. 
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The natural map D, ~ H*(M) is injective, for s > 0, but it is not generally 
onto, and the transpose H~*(M) — D_, is not generally injective. However, the 
natural map 


(A.19) He (i) —$D.; 


comp 


is injective, where Hj,)(//) denotes the space of elements of H~*(N) (N being 


the double of 1/7) with support in the interior of /. In particular, for any interior 


pointp € M, 
(A.20) 5p € D_s, fors > (n= dim M). 


Note that as p —> 0M, 6, — 0 in any of these spaces. From the isomorphism in 
(A.17), we have 


(A.21) Ga=A-by= Tb, 

well defined, and 

(A.22) Gp € D_n/242-e, foralle > 0. 

This object is equivalent to the Green function studied in this chapter. 

We can write any v € D,, even for s < 0, as a Fourier series with respect 
to the eigenfunctions u,;. In fact, defining #(j) = (v,u,;), as before, the series 
>; ©(7)u; is convergent in the space D, to v, provided v € Dg, so we are justified 
in writing 


(A.23) w= > 0), vé€Ds, foranyseR. 
J 


Note that —A : D, + D,_2 is given by 


(A.24) —Av = $7 Ajo(i)uj, 
j 


for any s € R. We can define 

(A.25) (—A)OH) : Ds —+ Dy_ae, 
for any 0,7 € R, by 

(A.26) (—A) Fy = SAE O(j)uy, 


where v € D, is given by (A.23). The maps (A.25) are all isomorphisms. Note 
that we can write the D,-inner product coming from (A.11) as 
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(A.27) (v, w)p, = (v, (—A)*w), 


where on the right side the pairing arises from the natural D, : D_, duality. 


B. The Mayer—Vietoris sequence in de Rham cohomology 


Here we establish a useful complement to the long exact sequence (9.67) and 
illustrate some of its implications. Let X be a smooth manifold, and suppose X is 
the union of two open sets, /, and M2. Let U = M,M Mg. The Mayer-—Vietoris 
sequence has the form 


(B.1) +» 3 HP) S&S WECX) Ss HEM) @ F(a) & HA) 


These maps are defined as follows. A closed form a € A*(X) restricts to a pair 
of closed forms on M/Z; and Mg, yielding p in a natural fashion. The map 7+ also 
comes from restriction; if 4, : U ~ M,, a pair of closed forms a, € Ak (ML) 
goes to Lta1 — Laz, defining y. Clearly, s{(a|iz,) = 23(a|m,) if a € A*(X), so 
yop=0. 

To define the “coboundary map” é on a class [a], with a € A*(U) closed, pick 
By € A®(M,) such that a = 3, — Go. Thus d@, = dG, on U. Set 


(B.2) d[a] =o] with o = df, on My. 


To show that (B.2) is well defined, suppose 3, € A*(M,) and G, — G2 = dw 
on U. Let {,} be a smooth partition of unity supported on {1/,,}, and consider 
w = ~1 81+ y2G2, where vy, G, is extended by 0 off M,. We have dy = yi df, + 
p2d Bo + dpi A (G1 — G2) =o + dy, A (G1 — 82). Since dy, is supported on U, 
we can write 


o = dy — d(dy; Aw), 


an exact form on X,, so (B.2) makes 6 well defined. Obviously, the restriction of 
a to each M, is always exact, so po 6 = 0. Also, if a = vjay — Laz on U, we 
can pick 3, = a, to define d[a]. Then dG, = da, = 0,sodoy7=0. 

In fact, the sequence (B.1) is exact, that is, 


(B.3) iméd= kerp, imp=kery, imy= ker6d. 


We leave the verification of this as an exercise, which can be done with arguments 
similar to those sketched in Exercises 11—13 in the exercises on cohomology after 
89. 

If M, are the interiors of compact manifolds with smooth boundary, and U = 
MM Mz has smooth boundary, the argument above extends directly to produce 
an exact sequence 


(B4) --- 3 HP) S H*(X) S HG) On") S HET) 4 --- . 
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Furthermore, suppose that instead X = M,U M2 and M,1 M2 = Y isasmooth 
hypersurface in X. One also has an exact sequence 


(B.5) «++ + HP) 2s HEX) 2 HG) © H*(Ma) & HRY) 3 


To relate (B.4) and (B.5), let U be a collar neighborhood of Y, and form (B.4) 
with M, replaced by M,, U U. There is a map 7 : U > Y, collapsing orbits of 
a vector field transversal to Y, and 7* induces an isomorphism of cohomology 
groups, 7* : H*(U) = H*(Y). 

To illustrate the use of (B.5), suppose X = S”",Y = S”~! is the equator, 
and M,, are the upper and lower hemispheres, each diffeomorphic to the ball B”. 
Then we have an exact sequence 


> HVB) @ HIB) AH sr“) S HES") 


(B.6) pa _ 
4S H*(B") @H*(B™) O---. 


As in (9.70), H*(B”) = 0 except for k = 0, when you get R. Thus 
(B.7) 6: H*1(5"-1) S H*(8"), fork > 1. 


Granted that the computation H'(S') ~ R is elementary, this implies #4”(S”) ~ 
R, for n > 1. Looking at the segment 


0 0°(S") 2 H°(B™) @ H°(B™) % 498") 4 41(8") 3 0, 


we see that ifn > 2, then ker y ~ R, so 7 is surjective, hence 6 = 0,so H1(S") = 
0, forn > 2. Also, if 0 < k < n, we see by iterating (B.7) that H*(9”") = 
H1(S"-*+1), so H*(S") = 0, for 0 < k <n. Since obviously #°(S”) = R for 
n > 1, we have a fourth computation of H*(S ), distinct from those sketched in 
Exercise 10 of §8 and in Exercises 10 and 14 of the set of exercises on cohomology 
after §9. 

We note an application of (B.5) to the computation of Euler characteristics, 
namely 


(B.8) x(M1) + x(M2) = x(X) + x(¥). 


Note that this result contains some of the implications of Exercises 17 and 18 in 
the exercises on cohomology, in §9. 

Using this, it is an exercise to show that if one two-dimensional surface X, 
is obtained from another Xo by adding a handle, then y(X1) = y(Xo) — 2. In 
particular, if 1/9 is obtained from S? by adding g handles, then y(M9) = 2—2g. 
Thus, if M9 is orientable, since H°(M9) ~ H?(M9) = R, we have 


(B.9) H'(M9) = R79, 
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It is useful to examine the beginning of the sequence (B.5): 
(B.10) 0 H°(X) 2 4°(M1) @ H°(M2) 2 H°(Y) > WX) 4+ 


Suppose C’ is a smooth, closed curve in S' 2. Apply (B.10) with M, = C, a collar 
neighborhood of C’, and Mz = Q, the complement of C. Since OC is diffeomor- 
phic to two copies of C, and since H1(S”) = 0, (B.10) becomes 


(B.11) 0>-RSROH°O) >RERSO. 
Thus 7¥ is surjective while ker y = im p © R. This forces 
(B.12) H°(Q) = ROR. 


In other words, 2 has exactly two connected components. This is the smooth case 
of the Jordan curve theorem. Jordan’s theorem holds when C' is a homeomorphic 
image of S', but the trick of putting a collar about C' does not extend to this case. 

More generally, if X is a compact, connected, smooth, oriented manifold such 
that Hi(X ) = 0, and if Y is a smooth, compact, connected, oriented hypersur- 
face, then letting C be a collar neighborhood of Y and Q = X \ C, we again 
obtain the sequence (B.11) and hence the conclusion (B.12). The orientability 
ensures that OC is diffeomorphic to two copies of Y. This produces the following 
variant of (the smooth case of) the Jordan—Brouwer separation theorem. 


Theorem B.1. /f X is a smooth manifold, Y is a smooth submanifold of codi- 
mension I, both are 


compact, connected, and oriented, 


and 


H'(X) =0, 


then X \ Y has precisely two connected components. 


If all these conditions hold, except that Y is not orientable, then we replace 
R OR by R in (B11) and conclude that X \ Y is connected, in that case. As an 
example, the real projective space RP? sits in RP? in such a fashion. 
Recall from §20 of Chap. 1 the elementary proof of Theorem B.1 when X = 
IR” *1, in particular the argument using degree theory that if Y is a compact, ori- 
ented surface in R”*! (hence, in S”+!), then its complement has at least two 
connected components. One can extend the degree-theory argument to the nonori- 
entable case, as follows. 

There is a notion of degree mod 2 of amap fF’ : Y + S™”, which is well defined 
whether or not Y is orientable. For one approach, see [Mil]. This is also invariant 
under homotopy. Now, if in the proof of Theorem 20.11 of Chap. 1, one drops the 
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hypothesis that the hypersurface Y (denoted X there) is orientable, it still follows 
that the mod 2 degree of F;, must jump by +1 when p crosses Y, so R"*1\¥ still 
must have at least two connected components. In view of the result noted after 
Theorem B.1, this situation cannot arise. This establishes the following. 


Proposition B.2. [f Y is a compact hypersurface of R"*! (or S"*"), then Y is 
orientable. 


C. Topological invariance of de Rham cohomology 


If X is a smooth compact manifold, the definition of the de Rham cohomology 
groups H*(X) depends explicitly on the differential structure of X. In light of 
this, it is of interest that the following topological invariance result holds. 


Proposition C.1. Let X and Y be smooth, compact, n-dimensional manifolds, 
and let 


(C.1) f:X —Y_ beahomeomorphism. 


Then f induces isomorphisms of de Rham cohomology, 
(C.2) ft HEY) S HR(X), for0<k<n. 


Part of the significance of this result lies in the fact that there are compact 
smooth manifolds that are homeomorphic but not diffeomorphic. Indeed, [Mil2] 
stunned the mathematical world by producing smooth manifolds homeomorphic 
but not diffeomorphic to S$”. Since then, there have been many results of this 
nature, particularly involving 4-manifolds and the use of the analysis of Yang- 
Mills equations to discern exotic differential structures. 


Proof of Proposition C.1. First, embedding Y smoothly in some Euclidean space, 
we can find a sequence of C'° maps y, : X — Y such that y, — f uniformly as 
v —> oo. Similarly, with g = f~1 : Y > X, we can find C® maps w, : Y + X 
such that w,, — g uniformly. It follows that 

(C.3) Yyopy: X — X and grow: Y > Y 


are smooth maps and w, 0 wy, and vy, o wy, uniformly tend to the identity maps on 
X and Y, respectively. Of course, we have induced maps 


(C.4) pe HAY) + H*(X), pt: HA(X) 3 HAY), 


for0 <k <n, hence 
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(C5) 10 Wy = (by 0 py)" HX) — HX), 
Eo gs = (Gr oWy)* HEY) HY). 

The key to the endgame is very simple. There exits NV such that for > N, wyop, 

and , © 4, in (C.3) are both smoothly homotopic to the identity maps on X and 

Y,, respectively, so the induced maps on cohomology are the identity maps. That 

is, the maps in (C.5) are the identity maps, for vy > N. Hence, forv > N, 


(C.6) ge HEY) S HX), oh HEX) SHY), 


these maps being 2-sided inverses of each other. 
We also note that, for N large enough, 
(C.7) 
pt, v > N => vy, %, smoothly homotopic and 7, w, smoothly homotopic 


=> yy =~, and Wy = y7, in (C4), 


so the isomorphisms (C.6) are uniquely determined by the map f/f. 
There are other varieties of cohomology groups, such as singular cohomology 
groups 


(C.8) Hew GG), 


defined when G' is a commutative additive group. We refer to the texts [GH] and 
[Hat] for definitions and treatments, including numerous topological applications. 
A key connection with de Rham cohomology is given by the following result, 
known as de Rham’s theorem. 


Proposition C.2. [f X is a smooth compact manifold, there is a natural isomor- 
phism 


(C.9) H*(X) & Hong (X,R). 


The classical treatment is given in [deR]. A simplified argument can be found in 
[Lee]. This argument makes essential use of the Mayer-Vietoris sequence, both 
for de Rham cohomology and its analogue for singular cohomology. A variant 
of Proposition C.2, using Cech cohomology in place of singular cohomology, is 
given in [BoT], [SiT], and [GuH]. 

The singular cohomology groups are designed to be topological invariants. As 
shown in [GH] and [Hat], in the setting of Proposition C.1, the maps f and g 
induce isomorphisms 


* k k 
f Peaaels G) —_ Hemel *; G); 
g* : Heing(X, G) —> Hong YG), 


sing 


(C.10) 
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which are 2-sided inverses of each other. Furthermore, with the choice of N such 
that 


(C.11) v > N => ¢, homotopic to f and w, homotopic to g, 


we have 


ys = f*, on Hang (Y,G), and 
wx =g*, on H*,(X,G). 


sing 


(C.12) 


Specializing to G = R and using the isomorphism (C.9), as specifically con- 
structed in [deR] or [Lee], recovers (C.6). 

As seen in this text, the use of Stokes’ theorem is a convenient tool to establish 
basic results about de Rham cohomology, and Hodge theory provides a power- 
ful analytical tool. On the other hand, singular cohomology has the advantage of 
applying to spaces X with no smooth structure, and the use of groups G other 
than R (such as Z, or Z/(2)) have many important applications (such as mod 2 
degree theory), which one can read about in such expositions as [GH] and [Hat]. 
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Check for 
updates 


Linear Evolution Equations 


Introduction 


Here we study linear PDE for which one poses an initial-value problem, also 
called a “Cauchy problem,” say at time ¢ = ty. The emphasis is on the wave and 
heat equations: 


Oru Ou 
(0.1) az ~Au=0, FH 


7p Au = 0, 


though some other sorts of PDE, such as symmetric hyperbolic systems, are also 
discussed. 

Sections | and 2 in particular treat (0.1), for u = u(t, x), where x is in a com- 
pact Riemannian manifold, or a noncompact but complete Riemannian manifold 
(perhaps with boundary), respectively. We make essential use of finite propagation 
speed for solutions to the wave equation to pass from the compact to the noncom- 
pact case. In §3 we treat Maxwell’s equations, for the electromagnetic field, by 
converting this system to the wave equation, where A is the Hodge Laplacian, 
and the boundary conditions are of the form studied in §9 of Chap. 5. 

Section 4 establishes the Cauchy—Kowalewsky theorem, for linear PDE with 
real analytic coefficients and real analytic initial data. We show that the solution 
u(t, x) is given as a convergent power series )> u;(x)t? /j!, whose coefficients 
u,(a) belong to certain Banach spaces of holomorphic functions. The argument 
here differs from the classical method of majorants. While it is straightforward, it 
does not generalize easily to nonlinear analytic PDE. We will give a treatment of 
the Cauchy—Kowalewsky theorem in the nonlinear case in Chap. 16. 

In §5 we use energy estimates for general second-order, scalar, hyperbolic 
PDE, derived in Chap. 2 to establish the existence of solutions to the Cauchy 
problem. We also provide a parallel study of first-order, symmetric, hyperbolic 
systems. The technique we use involves approximating the coefficients (and initial 
data) by real analytic functions and using the Cauchy—Kowalewsky theorem. 
A different technique will be presented in §7 of Chap. 7. 
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Section 6 discusses geometrical optics, a technique for constructing approxi- 
mate solutions to certain types of initial-value problems for hyperbolic equations. 
We continue this discussion in §7, illustrating the simplest situation where the 
eikonal equation of geometrical optics breaks down and caustics are formed. We 
study the geometry behind the formation of the simplest sort of caustics, and we 
study a class of oscillatory integrals, whose relation to solutions to the wave equa- 
tion will follow from material developed in the next chapter. 

In §8 we return to the heat equation on a smoothly bounded domain, with 
the Dirichlet boundary condition, and study boundary layer effects that arise for 
solutions with initial data that do not vanish at the boundary. Our analysis makes 
use of wave equation techniques and material from §6. 

In §9 we consider Schrédinger equations, 0u/Ot = iP(D)u, with P(g) = 
—BEé-€, B= B' € M(n,R), starting with the 1D case, where an investigation 
of piecewise smooth initial data leads to the Fresnel integral. 

There are two appendices at the end of this chapter. Appendix A establishes 
estimates for 0/Ox,; acting on certain spaces of harmonic functions on the ball, 
of use in the proof of the linear Cauchy—Kowalewsky theorem in §4. Appendix 
B establishes the multidimensional case of the stationary phase method, whose 
one-dimensional case arose in §7. The stationary phase method has other uses; in 
Chap. 9, we will apply it to some problems in scattering theory. 


1. The heat equation and the wave equation on bounded 
domains 


Let M be a compact, Riemannian manifold with boundary (which might be 
empty). On C®(M) is defined the Laplace operator, as usual. We consider here 
existence and regularity of solutions to the heat equation, and the wave equation. 
The heat equation is 


Ou 
(1.1) 5 = Au, 


for u = u(t,xz),t € R*, x € M. Here, we use Rt to denote [0, 00). We set the 
initial condition 


(1.2) u(0, x) = f(x). 
If OM # 0, we also pose a boundary condition. The Dirichlet condition is 
(1.3) u(t,z)=0, «Ee OM. 


The same methods apply to the Neumann boundary problem, 0u/Ov = 0, for 
x € OM,t € R™, and a number of other boundary problems. 


1. The heat equation and the wave equation on bounded domains 519 
Solutions to (1.1)—(1.3) can be constructed with the aid of the eigenfunctions 
of A, which arose in (1.11)-(1.13) of Chap. 5. Recall the orthonormal basis {w, } 
of L?(M) satisfying 
(1.4) Uj € Hj(M)nc™(M), Au; = —Ajuy;, 0< Aj; Ao. 


Given f € L?(M), we can write 


(1.5) f=) fOu, (0) (my). 
J 
Then set 
(1.6) u(t,x) = SiMe au), t>0. 
J 


Recalling the spaces D, defined in §A of Chap. 5, we see that 
(1.7) fed, ue C(R',D,); Bue C(Rt,D,_»,). 
It is clear that Q:xu = Au, fort > 0. If f € D, with s > n/2, then u € 


C([0,00) x M), and u(t, a) satisfies (1.2) and (1.3) in the ordinary sense. 
The uniqueness of solutions to (1.1)—(1.3) within the class 


(1.8) C(R*t,Ds) NC’ (R*, Ds_2) 
is easy to obtain, either by showing that the coefficients in the eigenfunction 


expansion in terms of the u; must be given by (1.6), or from the simple energy 
estimate 


d 
(1.9) qlee) 


a 
2 =2Re (=, u(t) = —2llu(t)lb,_, <9, 
s—2 Ot Des a 


for a solution to (1.1) belonging to (1.8). We denote the solution to (1.1)—(1.3) as 
(1.10) u(t, x) = ef f(x). 
Let us note that, by (1.6), 
(1.11) u€ C™((0,00),D,), forallao ER. 
In particular, for the solution u to (1.1)-(1.3), for any f € Ds, 


(1.12) u € C™((0,00) x M). 
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There is a maximum principle for solutions to the heat equation (1.1) similar 
to that for the Laplace equation, discussed in §2 of Chap. 5, namely the following. 


Proposition 1.1. [fu € C((0,a) x M)M C?((0,a) x M) and u solves (1.1) in 
(0,a) x M, then 


(1.13) sup u(t,x) = max ¢ sup u(0,2), sup u(t,x) >. 
[0,a)x M ceEeM «2€OM,tE[0,a) 


In particular, if (1.2) and (1.3) hold, then 


(1.14) sup u(t,x) = sup f(z). 
[0,a)x M M 


Proof. It suffices to show that 
u>Oon{0} x M U [0,a) x OM = u>0 on(0,a) x M. 


In turn, if we set u-(t, x) = u(t, x) +t, it suffices to show that, for any « > 0, the 
hypothesis above on wu implies u- > 0 on [0,a) x M. Indeed, if this implication 
is false for some u, then, since M is compact, there must be a smallest to € (0, a) 
such that we(to, xo) = 0, for some xp € M. We must have O;u-(to, 20) < 0 and 
Auz(to,%o) > 0. However, uz satisfies the equation O:u-e = Auz + ¢€, so this 
yields a contradiction, proving the proposition. 


There are sharper versions of the maximum principle, analogous to the Hopf 
maximum principle for elliptic equations proved in Chap. 5. See [J] and [PW] for 
more on this. 

One corollary of (1.14) is that the map (1.10) extends uniquely from a map of 
fé€D, ue C(R*,D,) (say for some s > n/2) to a mapping 


(1.15) f €Co(M) 4 ue€ C([0,00) x M), 
where C(I) is the space of continuous functions on M vanishing on 0M, that 
is, the sup norm closure of Cf°(M). 

Recall from §A of Chap. 5 that 6, € D_n/2~< for all e > 0. The “fundamental 
solution” to the heat equation is 


(1.16) H(t, 2,p) = e'95p(z). 


By (1.12), H(t, x, p) is smooth in (t, x), for t > 0. Since 6, is a limit in D_,,/2_- 
of elements of C§°(M) that are > 0, it follows that 


(1.17) H(t,z,p) >0, fort € (0,0),2Ee M,pe M. 
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In fact, there is a variant of the strong maximum principle, which strengthens 
(1.17) to H(t, x, p) > 0 fort > 0,2, p © M. We refer to [J] and [PW] for details. 
Next we look at the wave equation 


O07 u 
7 ~ Au=0, 


(1.18) a 


for u = u(t, x),t € R,x € M. The initial conditions are 

(1.19) u(0,2) = f(x), w(0,2) = g(a), 

and if OM is not empty, we impose the Dirichet boundary condition 
(1.20) u(t,z) =0, «EM. 


If we write u(t, x) as 
(1.21) u(t,x) = S~aj(t)u;(x), 

j 
with u, the eigenfunctions (1.4), then the coefficients a, (t) satisfy 
(1.22) a(t) + Ajaj(t) =0, a;(0) = F3), 45(0) = G(4), 
where f(j) = (f,u;), 9(7) = (g, uj), and hence 


(1.23) aj(t) = fj) cos dj/7¢ + g()Az/? sin Aj! 


J a] 


If OM = Q (and M is connected), then 0 is an eigenvalue of multiplicity one; 
Xo = 0. In that case, (1.23) is replaced by 


ao(t) = f(0) + 9(0)¢. 


For simplicity in writing formulas, we will ignore that case. 
Thus, assuming all A; are nonzero, a solution to (1.18)—(1.20) is given by 


(1.24) u(t, x) = S“[f(j) cos \j/7# + g(j)Aj 7 sin Aj/7t} u;(2). 


j 
This is equivalent to the operator expression 


sinty—A 
g. 


(1.25) u(t, x) = cost/—A f 4 VER 


522 6. Linear Evolution Equations 

We see that 

(1.26) f€D..9 € D1 > ve C(R,D,), Aue C(R,D,_,;), 

if u is given by (1.24). If s > n/2, then u € C(IR x M), and the boundary 


condition (1.20) is satisfied in the ordinary sense. 
If we use the “energy norm,” whose square is 


(1.27) E,(t) = |lu@|lp, + lle (|B, 
where ||v||p, = I|(-A)*? vl 2209, we see that if 
(1.28) u € C'(R,D,) M C?(R, Ds_1), 
then 
dE, 
= 2Re (u(t), u(t))p, + 2 Re (u(t), u(t) 5, 
(1.29) = 2Re (u(t), (—A)*u(t)) + 2 Re (u:(t), A(—A)*~!u(t)) 
— 0, 


provided u solves the wave equation (1.18). Thus we have the energy identity 
(1.30) E,(t) = E,(0), 


for all t € R. In the case Ag = 0, (1.27) annihilates constants, so we don’t quite 
get a norm. 

We saw in Chap. 2 that solutions to the wave equation that are sufficiently 
smooth satisfy the finite propagation speed property. We now show that this holds 
for general solutions, with initial data f, g as in (1.26). Thus we need to define the 
support of an element f € D;. Consider 


(1.31) Dog = (Dy. 
J 


We know that D,, C C%°(M), and we use the usual notion of support of an 
element of this space. If K C M is closed, s € R, we will say f € D, is “D- 
supported” in /¢ if and only if 


(1.32) (v, f) =0, forall v € D., such that supp v C M \ K. 


Soon we will just say f is supported in K, but a distinct term will be useful until 
a few points are clarified. We show right away that this notion coincides with the 
familiar notion of support when s > 0. 


1. The heat equation and the wave equation on bounded domains 523 


Lemma 1.2. Let K Cc M be closed, s € [0,00),v € Ds C L?(M). Then v is 
D-supported in K <=> v is supported in K in the usual sense, that is, v(x) = 0 
for almost all x € M \ K. 


Proof. Let w € D,, have support (in the usual sense) in a closed set L C 
M \ K. If v € Do vanishes pointwise a.e. on M \ K, then certainly (v,w) = 
a v(a)w(x)dV = 0. This establishes the implication <. 

Suppose conversely that (v, w) = 0 for all w € D that vanish pointwise on 
a neighborhood of K. In particular, (v,w) = 0 for all w € C§°(M \ K), sou 
vanishes pointwise a.e. on the open set U = M \ K Cc M, hence on the closure 


of U in M \ K. This completes the proof. 


It is useful to draw attention to one point related to the proof above, namely 
(1.33) for s < 0, C§°(M) is dense in D,. 


To illustrate the notion of “D-supported” for s < 0, we note that, given 
p € OM, there is a nonzero vy, € Ds, for any s < —n/2 — 1, defined by 
(u, Vp) = Ou(p) /Ov, and v, is D-supported on {p}. 

We now state the result on finite propagation speed. 


Proposition 1.3. If K C M is closed, and 
(1.34) Ka={xeM: dist(x,K) < d}, 
then if f © Ds,g € Ds—1 are D-supported in K, it follows that 


sin t/—A 
—A 


(1.35) cost¥—A f and ar 


g are D-supported in Ka, 


for |t| < d. 


Proof. Let v € D,. be supported in M \ Ky. We have 
(1.36) (costy—A - v) = ( f,costy—A v) 


But the results of Chap.2 apply to cost/—A v, which is smooth, so the 
right side of (1.36) vanishes for |t| < d. The same sort of analysis applies to 
(—A)~1/? sint,/—A g, to complete the proof. 


The next result should justify one’s simply saying that f € D, is supported in 
a closed set when it is D-supported in kx. 


Proposition 1.4. If s € R and f € Dy is D-supported in a closed set K C M, 
then for any neighborhood K 4 of K, there exists a sequence f; € Doo, all sup- 
ported in Kg, such that f; — f in Dg. 
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Proof. Pick y € C§°(—d,d), f y(t)dt = 1, and consider 
(1.37) f= / y;(t) costV—A f dt, y;(t) = je(Je). 
Integration by parts shows that 

(AK = f PPO costV=A f dt € D,, 


for all k, so f; € Doo. That f; + f in Dg is clear. Finally, by Proposition 1.3, 
each f; is D-supported in Ky, and so by Lemma 1.2 each f; is supported in Ky. 


Exercises 


1. Let f € C(R, Ds). Show that the unique solution u € C(R, Ds+1) to 


ae —Au=f, u(0) = ur(0) =0 
is given by 
(1.38) u(t) = ree —® F(r)ar, 


suitably interpreted in case 0 € Spec (—A). Show that 


(1.39) llu(t)llo.41 + |Deu(t) 


eee: [ l¢@llo. ar. 


2. Letu € C(R, L?(M)) satisfy 


2 

ot —Au=fonRx M, u=O0onRx OM, u=0, fort <0. 
Assume f € C*(R x M). Show that u € H***({0,T] x M) for any T’ < co. 
(Hint: Apply a variant of the s = 0 case of Exercise 1 to DJu, 0 < j < k. Once you 
have O?u = g € C(R, Hj(M)) nC? (R, L?(M)), apply the PDE to write 


(07 + A)u=29g-f, u=O0onR~x OM, 


and use elliptic regularity. Continue this argument.) 

3. Adapt the proof of Hopf’s maximum principle, given in §2 of Chap. 5, to the case of 
the heat equation, proving a stronger version of Proposition 1.1. Establish a version 
that treats u(t, 2) = e’“4-”) f(x), given V € C°(M) real-valued and > 0. Using 
e°tu(t, x) = e''A-Y—% f(x), remove the hypothesis that V > 0. 


Exercises 4-10 deal with regularity of solutions to the PDE 


Ou 


(1.40) ——Au= 


Ot f, Ulp+ xom =i 


We assume that u € C(Rt,D1). Let I = [0,7]. 
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. Suppose that (1.40) holds and u(0) = 0. Show that 
f €L?(Ix M) = Au and Que L?(I x M), 


and ||Aul|r2qrxar) < lf llz2qxm)- (Hint: If u is sufficiently smooth, compute 


(d/dt)||(—A)*/?ul| 5,241) to show that 
T 


A) 24, 4 7 U 2 =— U % 
Ay? uP) oan +2 f OuOllzsan a= 2 [FO Aut) po 


T 
2 

< ff [lrlie+llaue.]ae) 

. Omit the hypothesis that u(0) = 0. If I’ = [to, T] for some to > 0, show that 

fe¢L?(1x M) = Au and Gu € L*(I' x M). 


(Hint: Set v = y(t)u, where y € C*(R), y(t) = 1 fort > to, 0 for t < to/2. Then 
Ov — Av = v(t) f + ¢'(tu.) 


. Show that 


af €L?(Ix M)=— due L?(I' x M), and du € C(I’, D1). 
(Hint: If Spu(t,x) = h~"[u(t + h,x) — u(t, x)], consider estimates for 0;(5;,u), 
using the PDE 0;(5,u) — A(dnu) = onf, and let h — 0.) 
. Deduce that if 0) f € L°(I x M), for 0 < j < k, then 


aittue L’(I’ x M), and @fue C(I',Di), O<j<k. 


8. Assume now that 


(1.41) Of €L?(I x M), and f € L?(I,H?(M)). 


Show that Au € L?(I’, H*(M)), and hence u € L?(I, H*(M)). 
(Hint: Note that A(Au) = 0?u — 0:f — Af, while Au 5 gxe = ier 
The term 0?u is controlled by Exercise 6. For fixed t, apply elliptic estimates.) 


9. Now assume that 


(1.42) Of e L?(1,H?*-4(M)), O<7<k. 
Show that 
(1.43) Aue L?(1', H?*+?-73(M)), O< jf <k+1. 


(Hint: Reason inductively. Note that A/u satisfies 


A(A?u) = Of **u— (Of + PTA +--+ + A? + A’) f 


Ail scone _ —(@~ as a° A ee Se aad eee 


10. Deduce in particular that 
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(1.44) fec™((0,T] x M) = ue C™((0,T] x M). 
11. Parallel the results of Exercises 4-10 for solutions to du/Ot — Au = f, given 


(a) Neumann boundary condition, aul, txam = 0, 


(b) Robin boundary condition, O,u — a(x) =0. 


Ulps xOM 


2. The heat equation and wave equation on unbounded 
domains 


Here we look at the heat and wave equations on R x M when M is a noncompact 
Riemannian manifold. 

First we assume that [/ is complete and without boundary. We construct the 
solution to the wave equation 


2 


(2.1) — —Au=O0onRx M, u(0,x2) = f(x), u(0,2) = g(a), 


first under the hypothesis that 

(2.2) fe Hy(M),g€L°(M), supp f,g CK, 

where kK C M is compact. We produce the unique solution 

(2.3) u € C(R, H'(M)) C'(R, L?(M)) 

having the property that 

(2.4) supp u(t) iscompactin MM, VteR. 

To do this, let 0; Cc M be compact subsets with smooth boundary, such that 
O1, CC O2 CC ++: CC OF; CC / M. Given supp f,g C K and s > 0, pick 
N so large that K, C On, where K, = {2 € M : dist(x, kK) < s}. Now 
let A; be the Laplace operator on O;, with Dirichlet boundary condition, so that 


cos t,/—A; and (—A;)~1/? sint,/—A; are defined on L?(O;), H}(O;), and so 
forth, as in 81. By finite propagation speed, we see that 


sin t,/ 
(2.5) t) = cost,/—A, f + aa g, for|t]<s,7>N, 
A; 


has support in Oy and is independent of 7 > N. This specifies the solution to 
(2.1), given (2.2). 
We can define 
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(2.6) Ult){ fg} = tut), hu}, 
obtaining a one-parameter family of maps 
(2.7) U(t) : Co°(M) @ Ce (MY), 
satisfying the group property 
(2.8) U(0) =I, U(t, +t) = U(t)U (te). 
Also, if f, 9 € C§° (1), the proof of energy conservation given in Chap. 2 works: 
(2.9) [ldfllz2cy + Iiglzzay = Mew )llZ2 cay + llOru()IIZ2 cup 
for each ¢ € R. Let us set 
(2.10) H = completion of C° (MM) in the norm || f||7. = ||@f|| 22(12)- 
We have the following proposition. 


Proposition 2.1. The family of maps U(t) in (2.6) has a unique extension to a 
unitary group 


(2.11) U(t):H0 L?(M) —~HO@L?(M). 
We move on to the heat equation 
(2.12) —=Au, u(0,2) = f(a), 


first assuming that f € L?(/) has support in a compact set K. As with the wave 
equation, if Kk C O,, then e's f is defined by §1. Note that, in that case, 


1 eg 
(2.13) ei f = / e~* /4* coss,/—A; f ds. 
VAtt Joo 


This suggests considering 


(2.14) H(t) f(a —3/4t W(s) f(a) ds, 


as 


where W(s) f(a) = v(t, x) solves (2.1), with g = 0. Thus, if f is supported on K, 


(2.15) W(s)f(z) = coss,/—A; f(z) if Ky. c Oj. 
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Then 
(2.16) 
1 2 
H(t) f(z) =e f(a) + e* /4* |W(s) f(x) —coss/—A; f(a) ds, 
Tia | | 


where, assuming KK C O,, we set 
(2.17) T; = {s ER: dist(K,00;) < |s|}. 
Since cos s,/—A,; and (by (2.15)) W(s) have L?-operator norms < 1, we have 


(2.18) H(t)f = Jim, e's f in L?(M), 


given f € L?(M) with compact support. Here, e“*. f(a) is set equal to zero on 
M \ O;. Thus H(t) extends uniquely to an operator on L?(/), of norm < 1, and 
we have 


(2.19) H(t)f = Jim ei P. fin L?(M), Wf € L?(M), 


where P; f(x) = xo, (x) f(x). Material in Chap. 8, §2, will show that H(t) is a 
semigroup, whose infinitesimal generator is the unique self-adjoint extension of 
A from Cé°(M), when M is a complete Riemannian manifold. 

We will show that, for ¢ > 0, the operator H(t) has a smooth integral kernel: 


(2.20) H(#)f(«) = / h(t,@,4) Fy) €V(y). 


M 


Furthermore, under certain hypotheses on M, h(t, x, y) will be shown to decrease 
rapidly as dist(a, y) —> 00, for fixed t > 0. Let U; be open sets in M/, containing 
points xj, and suppose p = dist(U;, U2) = inf{dist(y1, y2) : y; € U;}. Assume 
f is supported in U;. Then finite propagation speed implies that 


1 


2.21) HU) fe) = = 


/ ee /4t W(s)f(a) ds, fora € U2. 


|s|>p 


Thus, if R; f(x) = xu, (x) f(x), we have 


1 
(2.22) |R2A(t)Rillcecz) < / en Mt dg < e/4t, 


|s|>p 


since, for T > 0, 
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Re 2 2 a 2 2 
(2.23) | e° /4dgs=e7 of erent da ge et 4, 
T 0 


To estimate derivatives, we can use the equation 0?W(s) = AW(s) and inte- 
grate by parts, to write 


(2.24) AHO fle) = T= (a%e-*'/#*) W(s) fe) ds, 


|s|2e 


given x € Uz, supp f C U,. Now there are estimates of the form 


(2.25) ere < Cyt *((48)15?)* oP, 
Hence 
|| R2A* A(t) Ril|e(z2) 
2.26 ad 
( ) < cut | (1+ 5?)¥ e-3 /4 ds < Ca hap Per 
p/vt 


the last inequality following by an appropriate variant of (2.23). Pick k > n/4, 
where n = dim M. There is a Sobolev estimate of the form 


(2.27) | f(w2)| < C(U2) [WA*Fllcscuny + Ifllzaws)) 
so we have 
(2.28) A(t, 2, JIlzauy) S C'CUn) (1+ EME *p*)h eh. 


By symmetry and another application of the argument above, we have 
(2.29) —‘|A(t, v2, a1)| < C’C(U1)C(U2) (1 + tt 1p?)*)"e/ 


Similarly, one can estimate higher derivatives. We have the following. 


Proposition 2.2. If M is a complete Riemannian manifold of dimension n, the 
operator H(t) given by (2.14) and (2.19) has integral kernel h(t, x,y), smooth 
on (0,00) x M x M, and satisfying an estimate 


(2.30) 0<Al(t,x,y) < Cr(a, d)K(y,5)(1+ ame oak maa 


where dist(x, y) = p + 20, K(x,5) = C(U), for U the ball of radius 6, centered 
atx, andk > n/4. 


The positivity in (2.30) follows from the positivity of the heat kernels 
hj(t,x,y) of e's (set equal to zero if x or y is in M \ O,). In fact, using 
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the maximum principle for the heat equation on R* x O,;, we obtain 
(2.31) O< heli,z,y) # boy) as 7 — co. 


In some cases, such as when M is a homogeneous space, perhaps with a 
compactly supported perturbation in the metric, one has a uniform estimate 
K(a,6) < c(d), independent of « € M. Then (2.30) implies the very rapid 
decay of h(t, x,y) as dist(x,y) —> oo. Estimates of the form (2.30) will be of 
occasional use later, for example, in Chap. 8, §3. Somewhat sharper estimates are 
proved in [CGT]. Other approaches to heat kernel estimates can be found in [CLY] 
and [Dav]. 

The results above can be extended to the case of M, a complete Riemannian 
manifold with (smooth) boundary. On OM, one could place one of a number of 
boundary conditions, such as Dirichlet or Neumann. Of course, it is no longer true 
that the solution operator U(t) as in (2.6) preserves C§°(M) @ C§°(M), but we 
do have such results as 


(2.32) — U(t): Ho comp(M) ® L? (M), 


comp 


1 2 
(M) —_ 16 sein 4) @ ar 
in the case of the Dirichlet boundary condition, where Fb comp(M ) consists of 
functions wu € Hj(M) such that u is supported on a compact subset of MM. We 
leave further details on such extension of the results above to the reader. 

We now discuss when the heat kernel h(t, , y) satisfies 


(2.33) [rtoy) dV (x) = 1, 
M 


for allt > 0, y € M. This has probabilistic significance. If M is compact (without 
boundary), (2.33) is clear. If M7 has boundary, and one uses the Dirichlet boundary 
condition, then (2.33) fails, but it continues to hold if the Neumann boundary 
condition is used. 

If M is a complete Riemannian manifold (without boundary), then we always 
have 


(2.34) [ree.2.u) Vio) <1, 


M 


as a consequence of (2.31), but (2.33) may fail in some cases; some examples are 
given in [Az]. In “nice” cases, such as when M has bounded geometry, (2.33) 
does not fail, as we will now show. 

Note that (2.33) holds if and only if 


(2.35) / e4 f(x) = fie f(x) dV(x), forall f € Co°(M), 


M 
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given that M/ is complete. Our approach to specifying a class of MM for which 
(2.35) holds will use the identity (2.14). Given f € L? (M), the integral on the 
right side of (2.14) is convergent in L?(M/). As long as M is complete, we have 


(2.36) [cossvV=B flo) 4 =f fe dV (a 


M 


for all s € R, f € C§°(M). Consequently, (2.35) holds, for a given t > 0, 
f © C§°(M), whenever it can be shown that, for some 6 < 1/4t, 


(2.37) l|cos sV=A fll acy < CO. 


Three ingredients go into the estimate of this Z'-norm. Two are Cauchy’s 
inequality plus the fact that the L?-operator norm of cos s\/—A is < 1, yielding 


1/2 
(2.38) |eossV=Z fll pcs...) < lleossV—A Slee (vol Boto(9)) 


Bsto 


where Bs4¢(p) = {x € M : dist(z,p) < s+}. The third ingredient is finite 
propagation speed; if f is supported on B,(p), then cos s\/—A f is supported on 
Bs+o(p), 8o the left side of (2.38) is all of || cos sV—A || 117). Consequently, 
given t > 0, (2.35) holds provided that, for some 3 < 1/4t, we have a volume 
estimate: 


(2.39) vol Bsi.c(p) < C(c) e®", Vs>0,0>0. 
In other words, if (2.39) holds, then (2.35) holds for all t < 1/4G. Then (2.35) 


extends to all f € L'(M), for t < 1/43. Consequently, it holds for all t > 0, and 
so does (2.33), as long as (2.39) holds, for some 3. Note that (2.39) follows from 
the estimate vol B,(p) < C fs? /2, Relabeling 3, we summarize what has been 
shown. 


Proposition 2.3. If M is a complete Riemannian manifold satisfying, for some 
B < ©, the volume estimate 


(2.40) vol Bs(p) < Cy e8*, 


then (2.33) and (2.35) hold. 


Exercises 


1. Let V(t) denote the solution operator to the following variant of (2.1): 
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Ou 

Ot? 
Assume MM is complete. Show that V(t) preserves C5° (MZ) ®@C 5° (IM) and has a unique 
extension to a unitary group on Hj(M) @ L?(M), where Hg(M) is, as usual, the 
completion of C§°(M) in the norm defined by I Flea = llaf lz2¢x2) + IIFllZ2 cary: 


—s? /4t 


(A-1)u=0, u(0)=f, uO) =g. 


2. Verify the estimates of the s-derivatives of e , given in (2.25). 
3. Maxwell’s equations 


Maxwell’s equations for the propagation of electromagnetic waves in a vacuum 
are written as follows, as seen in §11 of Chap. 2: 


OE OB 
(3.1) a curl B, ae curl E, 
and 
(3.2) dvH=0, divB=0. 


Here, F is the electric field and B is the magnetic field, both vector fields in a 
region of R®, and both varying with time t. Units are chosen so the speed of light 
c is 1. If the region M in R® is bounded by a “perfect conductor,” one sets the 
boundary conditions 


(3.3) vx E=0, v-B=0 ondM. 


We will investigate the initial-value problem for (3.1)-(3.3), where E(0,x) and 
B(0, x) are specified, subject to the condition (3.3). 

We will transform (3.1)-(3.3) into a system of equations for 
1-forms on M rather than vector fields on M, using the metric tensor to identify 
these, and then make contact with material developed in §9 of Chap. 5. If we 
let E, B be 1-forms on M corresponding to the vector fields & and B, then the 
equations above become, respectively, 


OE ~ OB - 
(3.4) a = 708, ap = 7 +a, 
(3.5) 6E=0, 5B=0, 
and 


(3.6) vAEBE=0, wB=0 ondM. 
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Here, E = E(t, x), B = B(t, x), and of course d and 6 involve only differentia- 
tion in the x-variables. The identity div H = —5E was demonstrated in Chap. 2; 
see (10.25) of Chap. 2. Note that we are using forms to describe the electromag- 
netic field in a completely different way than that used in §11 of Chap. 2. Here we 
are considering functions of ¢ taking values in spaces of forms on a 3-fold, rather 
than forms on a 4-fold. 

We can define the energy of the field (E, B) to be 


(3.7) E(t) = |E Olean + |BOlzzan- 


The following result expresses conservation of energy and of course also gives a 
uniqueness result. 


Proposition 3.1. [f £,B € H*({T,,T2] x M) satisfy (3.4) and the first part of 
(3.6), then 


(3.8) © = 46 FECTS, 
dt 
SO 
(3.9) E(ti) = E(ta), for t; E (T1, T2). 


Proof. We have 


Now 2 a a 7 7 
(*dB, EF) = (6 * B, FE) = (*«B,dE) = (B,*dE), 


where the second identity uses the hypothesis that v A E = 0 on 0M. Thus (3.8) 
is proved. 


In order to establish the existence of solutions to (3.4)—(3.6), we will produce 
a second-order wave equation satisfied by (EL, B), which can be solved by the 
methods of §1. Note that if we set 


(3.10) v= — —x*dB 


then a short computation gives 


2 [; ~ 
as + «dw = ca + 6dE, 

(3.11) at ate 
fa) 2B * 
cer ee + 6dB. 
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If (3.5) holds, then we can replace 6dE and 6dB in (3.11) by —AE and —AB, 
respectively. Now, if (3.4) holds, then v = w = 0, and hence (3.11) implies 


ae 


(3.12) a2 AE =0, 32 7 AB=0. 


The appropriate boundary conditions for E and B are relative 

(3.13) VABE= 0, vA 5E =0 on OM (relative boundary conditions), 
and 

(3.14) wB = 0, dB = 0 on 0M (absolute boundary conditions), 


where the last boundary condition is derived from (3.4) and (3.13), together with 
the fact that 7 : 
vyA«dB=0 = >1,dB=0. 


Now the existence of solutions to the initial value problem 


E(0, x) a E(x), E,(0, x) = E,(z), 


(3.15) : * ‘ s 
B(O, x) = Bo(z), B,(0, x) = Bi(z), 
follows from the methods of §1, given the material on the Hodge Laplacian with 
relative or absolute boundary conditions in §9 of Chap. 5. 

It remains to show that solving (3.12)—(3.15) produces solutions to the initial- 
value problem for (3.4)—(3.6). We have the following result. 


Proposition 3.2. Let (E, B) solve (3.12)-(3.15), and suppose the initial data in 
(3.15) satisfy 


(3.16) 6Ey =0, 6B) =0 
and 
(3.17) E, =*dBo, B, =—*dkp. 


Then (E, B) satisfies Maxwell’s equations (3.4) and (3.5). 


Proof. To see that solving the wave equations, (3.12)—(3.14), preserves the prop- 
erty of being annihilated by 6, note that the eigenfunctions of A with either of 
these boundary conditions can be arranged to belong to one of the terms in the 
Hodge decomposition; see Exercise 5 of §9, Chap. 5. Thus (3.16) yields (3.5). It 
remains to prove (3.4). For this, define v, w by (3.10), so (3.17) implies 


(3.18) v(0,2) =0, w(0,x7) =0. 
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On the other hand, (3.12) plus 5’ = 5B = 0 implies that (3.11) vanishes, that is, 


(3.19) Oe gia. 2 ape 6. 
Furthermore, the boundary conditions (3.13) and (3.14) imply 
(3.20) yAv=O and ,w=00nd0M. 


Consequently, Proposition 3.1 applies to the pair (v, —w), so v and w are identi- 
cally zero. This finishes the proof. 


Exercises 


1. Suppose (£7, B) solve Maxwell’s equations (3.1)—-(3.2) and the boundary condition 
vx E=0, for(t,z) € Rx OM. 


Suppose that v- B = 0 on OM att = 0. Show that v- B = 0 on OM for all t. 
(Hint: Compare (£, B) to the solution discussed in Proposition 3.2.) 
What can you say if you drop the hypothesis that v- B = 0 on OM att = 0? 


4. The Cauchy—Kowalewsky theorem 


The Cauchy—Kowalewsky theorem, in the linear case, asserts the local existence 
of a real analytic solution to the “Cauchy problem” 


m—1 


O* Ou 
_ oars S- + Aja( (t, x) ax” Ot => f(t, 2), 


J=0 |al<m—j 
u(to,%) = go(2),..., Ot u(t, x) = Jm_-1(2), 


(4.1) 


given that A;,.(t, x) and f(t, z) are real analytic on a neighborhood of (to, xo) in 
R”*! and go, ---,Gm-—1 are real analytic on a neighborhood of xo in R”. There is 
no loss of generality in taking ty) = 0,7 = 0. 

Any system of the form (4.1) can be converted to a first-order system 


(4.2) a = L(t,2)0,ut+ Lo(t,z)u+ f, u(0,x2) = g(a), 


where L(t, 7)0, = )7/_, L;(t,x) 0/Ox;. Here we assume that L;(t, a) are real 
analytic, kK x kK, matrix-valued functions, and f and g are real analytic, with 


values in C* . Note that if (4.2) holds, then 


4) F 
(4.3)  afttu =o fc (0]-*L) 0,0fu + (8 -*Lo) Ofu ] + ay. 


t=0 
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In particular, we inductively have a +140, x) uniquely determined. Thus (4.2) 
has at most one real analytic, local solution wu. 

On the other hand, if we can use (4.3) to get sufficiently good estimates on 
oa = u,;+1(«) that the power series 


(4.4) 2 ime 


is shown to converge, for t in some neighborhood of 0, then (4.4) furnishes the 
solution to (4.2). To be more precise, we set u(x) = g(x) and define uj+1(x) 
inductively by 


(4.5) Uj41(£ Sy e ) att &~*L,(0, x) - O,ue(x) + 02 f(0, x). 


£=0 v 


We sum over 0 < v < mand make the convention that 0, = 0/0x, forv > 1, 
while Opu = u. Our goal will be to get estimates on w;+1(#) guaranteeing the 
local convergence of (4.4). 

As illustrated in results on vector fields with real analytic coefficients (say on 
an open set U C R”) in Chap. 1, it is often useful to extend the real analytic 
coefficients and other data to holomorphic functions, defined on a neighborhood 
of U in C”. Here we will similarly extend L(t, x), f(t, x), and g(a) as functions 
holomorphic in x, in a neighborhood of 0 € C”. We keep ¢ real, for now. Without 
loss of generality, we can suppose that L(t, z), f(t, z), and g(z) are all holomor- 
phic for z in a neighborhood of the closed unit ball B C C”, with real analytic 
dependence on t, for |t| < 1. 

We will use the Banach spaces §); of functions f, holomorphic on B, having 
the property that 


(4.6) N;(f) = sup 6(z) |f(z)| 


z€B 


is finite, where 6(z) = 1 — |z| is the distance of z from OB. We will inductively 
obtain estimates for N;(w,;). From (4.5), we have 


(4.7) 
Jj : ; ; 
Nyaalupts) $92 D (3) [AFL (0) poco) Miao) + Naa ED. 


=0 v 


A key estimate is that, for a certain constant -y, depending only on n, we have 


(4.8) Nj41(Oz, Ue) < YF + I)N5(ue). 
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In order not to interrupt the flow of the argument, we establish this in Appendix A; 
see (A.8). Since 


(4.9) N,(v) < Ne(v), for £ <j, 
we have 
(4.10) | 
Nysaltjaa) $90 +1) a ¥ (3) lag 20) ae Neus) + NE. 
ae 


Given the hypothesis on L, we can assume there are estimates of the form 


(4.11) PLO | ars ZO XP ml 


for certain constants C’', and 4. Now, our inductive hypothesis on wz is that there 
exist constants C2 and jy such that 


(4.12) Ne(ue) < Cop’ fl, OSL <j. 


The ¢ = 0 case follows from our hypothesis on g(a). We can also assume that, 
for all 7, 


(4.13) Nyii(Ol f) < Cow (G+ 0). 
Substitution of these estimates into (4.10) yields 
(4.14) Ny4i(uyai) S$ yCrC2(j + I! Mu! + Co pt (G+ UL. 
£=0 
We are permitted to assume that w= 2X and w>2yC; + 1. Then ys dj-# 
pu’ < 27, so we have 
(4.15) Ny41(uj4i) S Co(G+1)! (27C1) yw? +Co pF (G41)! < Co wt? (G+)! 
This completes the induction; in other words 
(4.16) Nj(uj) < Co pw? j!, for alll j. 


We hence have the following proposition. 


Proposition 4.1. Given the real analyticity hypotheses on (4.1), there is a unique 
real analytic solution u(t,x) on a neighborhood of (to,xo) in R"*+. The size 
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of the region on which u(t, x) is defined and analytic depends on the size of the 
regions to which the coefficients and data of (4.1) have holomorphic extensions, 
in a fashion determined by (4.11), (4.12), and (4.16). 


Another approach to the use of estimates of the form (4.8) to prove the linear 
Cauchy—Kowalewsky theorem can be found in [Ho]. 

We restate the Cauchy—Kowalewsky theorem in a coordinate-invariant fashion. 
Let S be a smooth hypersurface in an open set O C R”. We say that S is nonchar- 
acteristic for a differential operator P = p(x, D) of order m if, for each x € S, 
op(X,V) = Pm(x,V) is invertible, where v is a nonvanishing normal to S at x. 
Now assume that p(x, D) has real analytic coefficients and S' is a real analytic 
hypersurface. Let Y be a real analytic vector field transverse to S. We consider 
the following Cauchy problem: 


(4.17) p(x, D)u= f, ul = 90, Ful, S94, ves ogee | = Gm-1- 


Then, on a neighborhood of any given x9 € S, we can make a real analytic change 
of variable such that, for some real analytic invertible A(x), Q = A(x)~'p(a, D) 
has the form of the operator in (4.1) and S is given by ¢ = 0. We do not 
claim that Y = 0/0t, but clearly 0? u| g can be determined inductively from 
Ul grees You 9 and vice versa. Then, with new f and g;, (4.17) acquires the 
form (4.1), so we have: 


Proposition 4.2. [f p(x, D) is a differential operator of order m with real analytic 
coefficients on O, S is a real analytic hypersurface in O, Y is a real analytic 
vector field transverse to S, and f and g; are real analytic, then there exists a 
unique real analytic solution to (4.7), on some neighborhood of S. 


Given the linear Cauchy—Kowalewsky theorem, we proceed to a uniqueness 
result of Holmgren. 


Proposition 4.3. Let P = p(x, D) be a differential operator of order m, with 
real analytic coefficients on an open set O C R”, and let S C O be a smooth, 
noncharacteristic hypersurface. Suppose that u € H™(O) solves 


Gi) 26,00 =0000,. u.=0, Yul =0.0,¥"" "|. =U, 


where Y is a smooth vector field transverse to S. Then u = 0 on a neighborhood 


of S. 


Proof. We can assume that O \ S has two connected components, O+ and O-. 
Alter u to produce v(x), equal to u(a) for 2 € O* and to 0 for z € O~. Then the 
hypothesis (4.16) implies 


(4.19) ve€H™(O), p(a,D)=0 onO. 


Pick x9 € S. If S is noncharacteristic at xo, then there exists a real analytic 
hypersurface No, tangent to S at 79. Cutting down O if necessary, we can make 
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a real analytic change of variable so that Q = A(x)~'p(a, D) has the form (4.1), 
for some invertible, real analytic A(a), and No is given by {t = O}, as illustrated 
in Fig. 4.1. (Say ¢ = x,,.) Picking Xo appropriately, we can arrange that S is given 
by t = y(a’) > |a’|?, where x’ = (@1,...,%n—1). The adjoint operator Q* also 
has real analytic coefficients on O. Let LU, = ON {t = Tr}. 

Now, according to the Cauchy—Kowalewsky theorem, together with the esti- 
mates on the size of domains of existence discussed above, we have the following. 
There exists 6 > 0 such that, for any 7 € (—06,5) and any polynomial a(z) on 
IR”, the Cauchy problem 


(4.20) Qw=a, w=dw=---=8"'w=0 ond, 


has a solution w, real analytic on {2 € O : |x — xo| < 6 + V6}. Thus, if we pick 
7 € (0,6) and let 2, be the set bounded by “1, and S' (so A, C OF), 


(4.21) (u, a) r2(a,) = (v, Q*w) r2(a,) = (Qu, w) 52a) = 0. 


Since, by the Stone—Weierstrass theorem, the set of polynomials is dense in 
C(L,), this implies u = 0 on 2,. Similarly, one establishes that u = 0 near 
Xo in O~,, and the proposition is proved. 


Exercises 


1. Show that the conclusion (4.16), leading to the Cauchy—Kowalewsky theorem, still 
holds if the hypothesis (4.11) on 0;" L,,(0, x) is weakened to 


(4.1 1a) S > Nm (0f"L1(0)) < C1 A™ ml. 


2. In the estimation of (4.14), we took ys = 2. More generally, work out the analogue of 
(4.15) when A = ys/K, K > 1. What happens if you try to take A = j1? How does this 
affect your ability to generalize (4.2)-(4.16) to the quasilinear case: 


zy 


z, 


Xo 


FIGURE 4.1 The Noncharacteristic Surface S$ 
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ou “Dh (au) 5™ + Lolt,2u)? 


For a proof of the Cauchy—Kowalewsky theorem for nonlinear PDE, see §4 of Chap. 16. 

3. If P,O,S, and Y are as in Proposition 4.3, show that whenever u € D’(Q) satisfies 
Pu = 0 on O, then Vl is a well-defined element of D’(S), for all j. Extend 
Proposition 4.3 to the case of all u € D’(O) satisfying (4.18). 


5. Hyperbolic systems 


We will use energy estimates and Sobolev space theory to establish the existence 
of solutions to linear hyperbolic equations of a more general form than considered 
in §1. To begin, let us examine second-order hyperbolic equations, of the form 


(5.1) Lu= u+ Xu= f, ul s, = Jo. Yul, = 91, 


where Lis the wave operator on a Lorentz manifold 2, assumed to be foliated by 
compact, spacelike hypersurfaces S,, the operator X is a first-order differential 
operator, and Y is a vector field transverse to Sy. The operator 0 = 6? — A on 
R x M, with S, = {(t,z) : t = 7}, dealt with in §1, is a special case, provided 
OM = 9. 

Energy estimates for (5.1) were established in §8 of Chap. 2. In particular, if O 
is the region in Q swept out by S,,0 <7 < 7, then, by (8.19) of Chap. 2, 


(5.2) lulz co) < Cll Lullz2(0) + CllgollFa¢s9) + Cll grllZ2¢s9)- 


The argument of Chap. 2 applies as long as u € H?(O). If L has formal adjoint 
L* =O1+ Xj, we similarly have 


(5.3) lvl] 2(0) < C\|L* vl 22(0), forv € Vr(O), 
where 
(5.4) Vr(O) = {w € C*(O) : w = dw = 0 0n Sp}. 


Now, to solve (5.1), when go = gi = 0, given f € L?(O), it suffices to obtain u 
such that 


(5.5) (u, L*v) =(f,v), forall v € Vr(O). 
However, by (5.3), given f € L?(©), we have 


(5.6) l(f.v)| < Cllfllz2(0) - I|L*allz2(0), 
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so by the Riesz representation theorem, the existence of u € L?(©) such that 
(5.5) holds is guaranteed. In fact, more generally, given f € H!(O)*, we have 


(5.7) \(f,¥)| < Cll flln(oy« L*ullz2(0), 
so we have a solution u € L?(Q) to (5.5) for all f € H'(O)*. 

Note that if w € L?(O) and Lu € L?(O), then tls, and Yul,, always exist, 
in H~?(So). If u satisfies (5.7), these Cauchy data vanish. Also, in this case, if 
we set f = Oandu = O0onU_., 5, = O°, we have Lu = f nOUO? = OF, 

Moving to the nonhomogeneous initial-value problem (5.1), if one has go € 
H3/2(S9) and g; € H'/?(So), one can construct U € H?(O) with such Cauchy 
data and subtract this off. Thus the argument above yields a solution u € L?(Q) 
to (5.1), given 


(5.8) fEelrO), 90€ H(S), 91 € H'/?(Sp). 


This existence result is not at all satisfactory and will be improved below. 

We can extend (5.2) to higher-order a priori estimates for sufficiently smooth 
solutions to (5.1), as follows. Suppose u € H*+!(O), which is more than adequate 
to imply that f ¢ H*-!(O), go € H*(So), and g, € H*~1(So). For simplicity, 
take OQ = RxT", S, = {7} x T”, so we have natural coordinate systems making 
D*@ meaningful. Then define 


(5.9) Ug = D@u. 


We produce a system of PDE satisfied by (ua : |a| < k — 1), as follows. There 
exist first-order differential operators Xg on Q) such that 


(5.10) LD°=D°L+ S> X,D*. 
Bl<lal 
Then 
(5.11) T= > Xgug = D*f. 
l<lal 


We can also determine ug | g and Y ua | So? in terms of derivatives of f, go, and g1, 
0 10) 
and we have 


Uals, = Goa € H*!*!(So) c H*(Sp), 


(5.12) 
Yuals, = Gla € H*-1-lel(Sp) Cc EL? (Sp). 


Now the energy estimate (5.2) applies to the system (5.11)—(5.12), so we have 
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(5.13) 


s IltallF(0) = C>7[ID*Fl2a.0) + Il Goall#+(s0) + IIallZ2¢s9) , 
la|<k—1 a 


and hence 
(5.14) |lull3e(o) < Cll Lullze—2(0) + Cll gollz*¢s,) + Clg ll F-21055): 


We want to show that, given f € H*(O), go € H*(S0), g1 € H*'(So), with 
k > 1, there exists a unique solution u to (5.1) and that u € H*(O). We will 
establish this by obtaining u as a limit of solutions to approximating hyperbolic 
equations, having analytic coefficients and data, for which a solution is guaranteed 
by the Cauchy—Kowalewsky theorem. A different sort of existence argument can 
be found in Chap. 7, §7. 

Let us assume S, is given by t = 7 inQ = R x T”, and Y = 0/0t. Now we 
can approximate the coefficients of L in C°°(R x T”) by functions that are real 
analytic on R x T”. We can think of these functions as being defined on R x R”, 
and Z”-periodic, and can arrange that the coefficients have entire holomorphic 
extensions to C x C”. Denote the resulting operators by L,. Given k € Z", let 
us assume that 


(5.15) feH*1(Q), go € H*(S0), 91 € H*'(Sp). 


We approximate these functions, in the appropriate norms, by real analytic func- 
tions f,, gov. giv, having entire holomorphic extensions in the sense mentioned 
above. Consider the initial-value problems 


(5.16) Lyiy = fur Ww] g, = Gor, Atty], = S10 


The Cauchy—Kowalewsky theorem applies to (5.16), and results of §4 imply that, 
for each v, there is a unique solution u,(t, x) that is real analytic on all of R x T”. 
The energy estimates of the form (5.14) hold uniformly in v, for any given O = 
[-T,T] x T”. In other words, given (5.15), {u,} is bounded in H*(O). Thus 
there is a subsequence u,,; — u weakly in H *(O). It is clear that such a limit w 
solves (5.1). We have the following result. 


Proposition 5.1. Given f,90,91 satisfying (5.15), there is a unique solution 
ué H*(Q) to (5.1). 
The final point to discuss in this result is uniqueness. If k > 2, this is immediate 


from the energy estimate (5.2). In fact, we can derive a more general uniqueness 
result by a duality argument. Namely, suppose 


(5.17) uEeD(Q), u=0on [J S,, Lu=0 on. 
T<O 
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Given f € C§°(Q), we can apply the existence part of the proposition, with L 
replaced by L*, and with time reversed, to produce, for arbitrarily large k > 0, 


(5.18) vé€ HQ), v=0, fort>>0, Ltu=f. 
Pick k so large that u € H?~*, on a neighborhood of the support of f. Then 
(5.19) (u, f) = (u, L*v) = (Lu,v) =0, 


which implies « = 0. This finishes the proof of Proposition 5.1 and also estab- 
lishes the uniqueness of the solution wu in (5.8), which can consequently be seen 
to belong to H'(O). 

We now look at a class of first-order N x N systems, of the form 


(5.20) = 4 S- Ay(t,2) oe + Bit, x)u= f(t,x), u(0,2) = g(a). 
j 


Let us suppose the various functions, f(t,x), and so on, are defined on Q = 
R x T”. The system (5.20) is said to be symmetric hyperbolic provided each 
N x N matrix A; satisfies 


We will derive energy estimates for solutions to (5.21) in a fashion similar to that 
used in §8 of Chap. 2. Suppose O C R x T” is bounded by two surfaces, 1 and 
Ng, as illustrated in Fig. 5.1. If we denote the left side of (5.20) by Lu, then, by 
the Gauss—Green formula, in the form established in (9.17) of Chap. 2, 


(5.22) (Lu, u) — (u, L*u) = : J (ovlt,2,v)usu) dS, 
00 


where the inner products on the left are inner products in L?(), and v is the 
inward normal to 0Q, as illustrated in Fig. 5.1. Note that 


OA; 


(5.23) Ltu=—-Lu+Cu, Cu=-— S- On," + Bu, 
provided (5.21) holds. Thus we have 
1 
(5.24) 2 Re(Lu, u) — (u, Cu) = = f(ovtt2, v)u, u) dS. 
a 


00 


Note that if v = (v9,1,...,Y%m) € T*(R x T”), then 
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x, 


FIGURE 5.1 Spacelike Bounded Regions 


FIGURE 5.2 Spacelike Sweeping 


1 nm 
(5.25) 5ou(t,2,v) = vol + 5° A; (t,x)v;. 


j=l 


Thus (1/)oz(t, x, v) is positive-definite on 4, and negative-definite on U2 if 
these surfaces are close enough to horizontal, that is, if v is close enough to 
(1,0,...,0) on %y and to (—1,0,...,0) on So. If this definiteness condition 
holds on %1;, we say &J; is spacelike, for the operator L. Compare the notion of 
a spacelike surface for 0, given in Chap.2. Also, we say ¢ € T;(Q) \ 0 is 
(forward or backward) timelike if (1/t)oz(z,¢) = A(z,¢) is (positive- or 
negative-) definite. 

Suppose 4; and 42, bounding O as above, are both spacelike. Also, suppose 
that O is swept out by spacelike surfaces. To be precise, suppose that there is a 
smooth function y on a neighborhood of O such that dy is timelike, and set 


(5.26) Os=Onigas, *Win=|=Onte=st 


We suppose O is swept out by No(s), 9 < s < 51, as illustrated in Fig. 5.2, with 
Neg = No(s). Also set 


(5.27) 5b(s) = Din {y < s}. 


As in (5.22), we have (with v2 = dy/|dy)): 
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(5.28) 
(A(z, V2)u, u) dS = / (A(z, v)u, u) dS — 2 Re(Lu, u) + (u, Cu) 


¥a(s) xi (s) 


< [Oe mwen dS +K / [|Lul? + |u|?] dV. 


=i (s) O(s) 
Now, parallel to (8.13) of Chap. 2, we set 
(5.29) Oe / Ride wady 
O(s) 
and estimate the rate of change of E(s). Clearly, 
dE 
(5.30) ae <C / (A(z, V2)u, u) dS, 
Ya(s) 


so, by (5.28), we have an estimate of the form 


E 

(5.31) “ < CE(s) + F(s), 

where 

(5.32) F(s) = al |uj? dS +C / |Lu|? dv. 
21 O(s) 


This differential inequality yields 


(5.33) E(s) < / e@(8—") F(r) dr. 
So 
Consequently, 
(5.34) i |u|? dV < C(s — 80) / jul? dS+C / |Lu|? dV. 
O(s) dy O(s) 


From here, the existence and uniqueness of solutions to (5.20), as well as the finite 
propagation speed, follow by arguments parallel to those used for L = + X. 
We leave the formulation of such results to the reader. 
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Exercises 
1. Supplement (5.2) with 
[lel 7225p) ae IY ullz2¢s,) < C\|Lull72(0) a5 Cllgollzz1 (so) ab Cllgullz2 (59); 
when L has the form (5.1). More generally, supplement (5.14) with 
[eel er (Spy FY Ullzre—2¢5.5) < Cl|LullFx-1(0) +Cllgoll31*(s9) + CllgrllF*-1(59)- 


(Hint: Look at (8.20), in Chap. 2.) 

2. Show that the part of Maxwell’s equations given by (3.1) forms a symmetric hyperbolic 
system. 

3. Supplement (5.34) with 


2 2 2 
ellz2~H5) < CllLullz2(0) + Cllullz2(s,); 


when L has the form (5.20). 

4. Making use of (5.22)-(5.34), formulate and prove an existence and uniqueness result 
for the symmetric hyperbolic system (5.20), parallel to Proposition 5.1. Also give a 
precise formulation of finite propagation speed for solutions to such a system. 

5. Generalize the study of symmetric hyperbolic systems (5.20) to include 


ou 


Ba; + Bt): 


(5.35) Lu = Ao(t, oe + 5° Aj(t,2) 
j=l 


where A; satisfy (5.21), and in addition Ao(t, x) is positive-definite. 


6. Geometrical optics 


In this section we look at solutions to the wave equation 


O7u 
ay ~ Au=0, 


(6.1) DI 


on R x M, where M is a Riemannian manifold, having either initial data with a 
simple jump across a smooth surface, of the form 


(6.2) u(0, x) = a(x) H (y(a)), 
or highly oscillatory initial data: 
(6.3) u(0, x) = a(x) F(Ay(z)). 


Here, H(s) is the Heaviside function; H(s)=1 for s>0, H(s)=0 for s <0, 
while F’'€ C™(R) is bounded, together with all its derivatives, as well as an 
infinite sequence of antiderivatives. We imagine that \ is large. We assume 
a € C§°(M) and Vy # 0 on a neighborhood U of supp a. For simplicity, we 
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complete the set of initial conditions with 
(6.4) uz(0,2) = 0, 


though the methods developed below extend to more general cases. We will show 
that, at least for |t| < T’, with T small enough, u(t, x) has an asymptotic behavior 


(6.5) u(t, x2) ~ SuG,2) 


in a sense that will be made precise below, where, in case (6.2), 


(6.6) uj(t, 7) = » a; (t,x)hj (p*(t,x)), 


for certain functions h; € C'°(IR \ 0) whose jth derivative jumps at 0, and, in 
case (6.3), 


(6.7) uj(t, v7) = u,;(t,2,A) = S- Aa, (t, «) Fj (Ay*(E, 2)), 


for certain F; € C°°(R). In both cases, a>, p* € Cc ((-T,T) x M), with 


(6.8) gp (0,2) = y(2), 


and aj (0,x) + ap (0, x) = a(x). The functions y+ are called “phase functions,” 
and the functions ay are called “amplitudes.” We take hg = H and Fo = F. 

The asymptotic relation (6.5) will imply in particular that u — 5° j<n Uj 18, 
for large N, relatively smooth, in case (6.2), and also relatively “small” in case 
(6.3), as A + oo. We give the details of the construction in the case (6.3) before 
sketching a similar treatment of the case (6.2). 

In order to compute the action of 0? — A on (the sum over 0 < j < N of) the 
right side of (6.5), when u,(t, 7) has the form (6.7), we recall that 


A(uv) = (Au)v + 2Vu- Vu + u(Av), 


6.9 
es) A(F(u)) = F'(u)Au + F”(u)|Vul?. 

Here, we use the dot product to denote the inner product with respect to the 
Riemannian metric; Vu - Vu = g/*(0;u)(Oxv). Thus, if u; has the form (6.7), 
we obtain 
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(6.10) 


(0? -— Ajuy = S73? -Fa# FY Ag*) (lay 


+ AF Fi (Ap™) (2% Oa — 2V.p* - Veaz + aFOy*) 


2 _ [vae*|”) 


—x IF (Ap it a; )]. 


In particular, the coefficients of A“ = A'~4 in (0? — A) 37.9 uj(t, x) are of the 


following form: 


(11) p=2: ae FAs )(jae*? - |Vae*|"), 


(6.12) w=1: Dla Fly )(lae 


+ F'(Ap*) (2yF Gay — 2Va2~* + Vzag + agp )]. 


(6.13) <0: Gr )(laxg 


+ Fi(Ap*) (297 Ona; — 2Vap~ - Vea; +a;Oy ) 


: ~ [vev*|") 


a \v.o*|") 


+ Fj-1(Ag*)( af) ‘ 


We will set these terms successively equal to zero. To begin, the term (6.11) 
vanishes provided y~ satisfies the eikonal equation: 


(6.14) 


\d.e*|? _ |V.pe =0. 


If we use (6.8) to specify y*(0, x), then the results on this first-order nonlinear 
PDE obtained in §15 of Chap. | apply. There is a neighborhood U of K = supp a 
and a T’ > 0 such that this initial-value problem has a unique pair of solutions 


(6.15) y* (0,2) = 


y* € C®((—T,T) x U), satisfying 


p(x), O:p~ (0, 2) +|V2(2)|. 


Having so specified ~~, we see that the terms (6.12) and (6.13) simplify. The term 


(6.12) vanishes provided 


Oa O 
(6.16) a 


By (6.15) we see that y~ 
(6.16) for ag are called the first transport equations. The initial conditions for a5 


= 2V,9~ -Vzaq — ag (Oy*). 


# 0 on U (if |¢| is small enough). The linear equations 


are deduced from (6.3) and (6.4). We want 
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(6.17) a a, =a ptat +y,ap =0, at t = 0; 


hence, in light of (6.15), 


(6.18) Gg Oe) =n (U2) = 5a(2). 


We have ag € C((—T,T) x U), compactly supported in U for each t € 
(—T,T), if T is small enough. 

Next, the term (6.13), for zu = 1 — 7 < 0 (ie., 7 => 1), vanishes provided that 
Fi (Ap*) = Fy-1(Ae*), that is, 


(6.19) F;(s) = [a0 ds 


and 


40a; 
(6.20) 20; ap = eve" VG; — (Oy )a; —Oaz_,, 


fe ky 


which are higher-order transport equations. To obtain the initial conditions, note 
that if u(t, 2) is given by (6.5) and (6.7), then 


(6.21) Oyu; ~ So prrta} Fi(Ag*) yt + 9 (Oat) Fp )|. 


Thus, using (6.4) and also requiring u; (0,2) = 0 for j > 1, we require 


(6.22) 
at +a; =0, S- [@’ F(Ag*) ye + (Aaz_1)F)-10¢*)] =0, att=0, 


or, using (6.19) and (6.15), 
(6.23) at +a; =0, ye (ap —a;)=—-O; (ayy + a;_1), att = 0. 


This specifies ay (0,2) and a; (0,2). Then the transport equations (6.20) have 
unique solutions a> € C°% ((-T, T)xU Vs compactly supported in U for each 
té€(-T,T). 

The construction described above, via the eikonal and transport equations, is 
the basic case of the method of geometrical optics. We now obtain some estimates 


on the degree to which such a construction approximates the solution to (6.1), 
(6.3), and (6.4). If we set 
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N 
(6.24) ww = Sou, 
j=l 


then vy satisfies 


DN 
(6.25) Ot? 
vn (0,2) = a(x) F(Ag(z)), Opun (0,2) = pn (x), 


— Avy =rn(t,2), 


where 

(6.26) pn(x) = ANS © ax (0,2) . Fx (Ag) 
and 

(6.27) r(t,) = ™ } (Gay) Fv (Ap*). 


The following result is elementary. 


Proposition 6.1. If p* € C%((—T,T) x M) and b € C§°(M), then 


(6.28) {\-"b(a) Fw (Ap~) : A > 1} 
is bounded in C4 ((-T, Tea (M)), for each 1, j > 0, provided F'y(s) and 
all its derivatives are bounded. 
Now, u — vy Satisfies 

(0; — A)(u— vn) = —ry, 
(6.29) 

(u—vy)(0,2)=0, O(u—vn)(0,2) = —pn(2), 

so we have the following. (Compare with Exercise | of §1.) 


Proposition 6.2. The geometrical optics construction of vn produces an approx- 
imation to the solution u to (6.1), (6.3), and (6.4), satisfying 


(6.30) u— vy is O(A~”) in? ((-T,T), HN*+-”-4(M)), 
for0 <v<N, 7 > 0, as long as, for each N, F(s) and all its derivatives are 
bounded. 


The most common function to take for F(s) = Fo(s) is F(s) = e’’, in which 
case Fy(s) = ie". Other equally good functions include F(s) = cos s and 
F(s) =sins. 
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Let us note that (6.28) is not sharp. We can improve it to 


(6.31) {A "b(a) Fw (Ap) : \ > 1} is bounded in C"((—T,T) x M). 
Consequently, we can say that if N’ > N, 
(6.32) unr — un is O(A_”) in CN*1-” ((-T,T) x M), 


for0 < v < N. Then if we apply (6.30) to u — vy, with N’ very large, we 
conclude that 


(6.33) u— vy is O(A~”) in CNT” ((-T,T) x M), 


forO<v<N. 

There is a construction analogous to (6.10)—(6.23) for the initial-value problem 
(6.1), (6.2), and (6.4), whose initial data have a simple jump discontinuity. As 
mentioned above, the form (6.5)-(6.6) furnishes an approximate solution. The 
phase functions y~ also satisfy the eikonal equation (6.14), and the amplitudes 
az (t,x) satisfy transport equations similar to (6.16), and (6.20). Parallel to the 
relation (6.19) between Fj_1(s) and F(s), we have h;(s) = [ hj—-1(s) ds, with 
ho(s) = H(s), the Heaviside function. Thus, for 7 > 1, we can take 


hj(s)=0, fors <0, 


(6.34) 3 
ir for s > 0. 


Having constructed the terms u;(t, x) of the form (6.6), we can again use energy 
estimates for the wave equation to show that u— > j<n Uy has high-order Sobolev 
regularity if N is large. Comparison with the sum oy <n Uj for N’ >> N, 
parallel to (6.32)—(6.33), then shows that 


(6.35) u- So uj € CO ((-7,T) x M), 
ISN 


i.e., Nth order derivatives are Lipschitz continuous. 
Note that the singular support of }7,< uj, hence of u, in (—T,T) x M is 


contained in the union of the level sets y*(t, 7) = 0, each of which is a charac- 
teristic surface for Ll. This phenomenon is a special case of a general result about 
the “propagation of singularities” of a solution to a PDE, which will be treated in 
Chap. 7, 89. 

Let us mention a geometrical characterization of the level surfaces 


(6.36) Sa = {(t,2) : y(t, 2) = B} 
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of a solution ¢ to the eikonal equation (6.14). Namely, each Sg is swept out by 
“light rays” y(t) = (t,2(t)) passing orthogonally over the level set Ug = {a : 
p(x) = G} at t = 0, where a light ray is a null geodesic for the Lorentz metric 
—dt? + > gjx dx; dx; on R x M. Equivalently, x(t) is a unit-speed geodesic 
on M, such that «(0) is orthogonal to %g. This follows by arguments used to 
establish Proposition 15.4 and Corollary 15.5 in Chap. 1. 

So far we have looked at approximate solutions to the wave equation whose 
supports do not intersect a boundary. We now consider the reflection of such 
waves. Thus, let 9 be an open subset of 1/7, with smooth boundary. Suppose the 
function a(x) in (6.2)-(6.3) belongs to C§°(Q), and we want to solve (6.1) on 
R x Q, with the Dirichlet boundary condition, 


(6.37) u(t,z) =0, «2 € OQ, 


plus an initial condition: either (6.2) or (6.3), and (6.4). Suppose that the geometri- 
cal optics construction above works, for t € (—T,T), if we make the construction 
on (—T,T) x M, and that the associated u,(t, x), of the form (6.6) or (6.7), have 
supports intersecting OQ. In that case, we want to construct u, to satisfy (6.36), 
by subtracting w, the solution to 


OM hey RIO (0,2) = w,(0,2) =0 
> — Aw= , w(0,x) = w;(0,x) = 0, 
(6.38) at ‘ 


w=vonR x dQ, 


where v = Di icy Uj: 

Let us restrict attention to t € (0,7). Suppose our wave has the form (6.5)— 
(6.6), so it has singularities on the surfaces y*(t,x) = 0. By the superposition 
principle, we can consider just one of the terms in the sum over 7 and +, so let us 
drop the + superscript and suppose 


(6.39) v = a;(t, z)hy (p(t, 2) 


in (6.38). Then we will construct an approximate solution to (6.38), in the form 


(6.40) w(t, 2) ~ 5” be(t,x)he(W(t,2)) = 5° welt, 2), 


e>j e>j 
granted a geometrical restriction, which we describe below. To do this, we have 


computations parallel to (6.10)-(6.13). Thus, as in (6.14), we have for w(t, x) the 
eikonal equation: 


(6.41) |3eb|? — |Vawpl? = 0. 
We want w; = v on (0,T) x OQ, so we set 


(6.42) (t,x) = y(t,x) on (0,T) x OQ. 
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There are several ways to describe our geometrical hypothesis. One is that the 
surface (0,7) x 00 in (0,7) x M is noncharacteristic for the eikonal equation 
(6.41), at the data (6.42) (on the support of a(t, )). An equivalent formulation 
is that if we set 


(6.43) Ce = Sa N{(0,T) x AO}, 


where Sz is the level set (6.36), then Cy is a spacelike hypersurface of (0, 7’) x 0Q, 
with its induced Lorentz metric. Recall that Sig is a union of light rays. Another 
equivalent hypothesis is that each of these light rays that hits (0,7) x OO does so 
transversally. Let us assume in addition that each such light ray (inside some Sig, 
issuing from supp a at t = 0) hits (0,7) x OQ exactly once. 

To continue our construction of the transversally reflected wave, we want to 
solve (6.41)—(6.42). In fact, under the geometrical hypothesis just stated, this has 
exactly two solutions. One of them is y(t, x) itself. The level sets {y(t, x) = 3} 
are swept out by light rays issuing from Cg which point in the negative t-direction 
as they go into 2. The solution of current interest to us is the other one; its level 
sets {¢)(t, x) = G} are swept out by light rays issuing from Cg which point in the 
positive t-direction as they go into 2. See Fig. 6.1 


| 


wt, x) =B 


(0, T) x dQ 


o(x) =B 


FIGURE 6.1 Reflected Wave Front 


Having w(t, x), we construct the amplitudes be(t, «) by solving transport equa- 
tions, parallel to (6.16) and (6.20). We take 


(6.44) bj (t,2) =a;(t,2), be(t,xz) =0, cea, &>j. 
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In particular, each be(t, x), hence each we(t, x), vanishes on [0,7,) x ©, for some 
T, € (0,T).NowifWy = S> we, we have w — Wy satisfying: 


ISLS N 
O? 
(Eaten, 


(w—-Wy)(0,x)=0, O(w—Wy)(0,2) = 0, 
(w—Wy)(t,x) = pn(t,z), «x € dQ, 


(6.45) 


similar to (6.29), where ry and py are fairly smooth, on (0,7) x Q and on 
(0,7) x OQ, respectively, if N is large, and both vanish for 0 < t < 1%, for 
some 7; > 0. It follows from the results of Exercise 2 in §1 that w — Wy is 
arbitrarily smooth, for N sufficiently large, so such a construction succeeds in 
approximating the reflected wave, granted the transversal reflection hypothesis 
made above. 

When the transversality hypothesis made above is violated, the reflected wave 
can have a much more complicated structure. Some of the basic cases of this 
phenomenon are dealt with in detail in [Tay], Vol. 3 of [Ho], and [MeT], to which 
we refer for citations of the original papers. 


Exercises 


1. Extend the geometrical optics construction of approximate solutions to (6.1) and (6.3), 
with (6.4) replaced by 
ur(0, x) = b(x)AF’ (Av(z)). 
2. Work out geometrical optics approximations for solutions to hyperbolic systems, of the 
form (5.20), assuming strict hyperbolicity, that is, for each € € R” \ 0, 3> Aj(t, x)&; 
has n eigenvalues A,,(a, &), all real and distinct. 


7. The formation of caustics 


The geometrical optics construction of §6 breaks down when the eikonal equation 
(6.14) does not have a global solution, which is a typical state of affairs. We can 
see this happen in the case where M is R”, with its flat Euclidean metric. In such 
a case, for small t, the solution to (6.14) is given implicitly by 


7.1) p*(ty)=9(2), y=xttN(2), N(x) =|V¢(2)|"'V¢(2). 


In other words, if S C R” is a level set of y, then, for fixed t, the level sets of 
p= (t,-) (Le., the “wavefronts”) are the images F,,(.S') of S under the maps F':, 
on R”, defined by Fy;(z) = x +tN(a). As |t| gets larger, these images can 
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wavefronts at tf = 2 


wavefronts at f= 0 


FIGURE 7.1A Caustics IA 


develop singularities, or “caustics,” as illustrated in Figs. 7.1a and b, in the case 
n = 2, where the level sets are curves. 

Note that DN(ax) annihilates N(x) and, if x € Hg = {y(x) = 8G}, then 
DN(«) leaves T,X invariant and acts on it as — A, the negative of the Weingarten 
map (discussed in §4 of Appendix C, on connections and curvature). Thus the 
eigenvalues of DN (2) are 0 and the negatives of the principal curvatures of Ug 
at x. Consequently, the derivative 


(7.2) DF,(x) =I +tDN(z) 


FIGURE 7.1B Caustics IB 
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is singular if and only if 1/t is the value of a principal curvature of ig at x. 
We will describe some of the simplest asymptotic behaviors of solutions to 
(6.1), (6.3), and (6.4) when M = R?. To recall the equations, we have 


2 
| on R x R?, 
(7.3) Ot 


u(0, x2) = a(x)F(Ay(x)), uz(0, x) = 0. 
We will take 
(7.4) Fisj=e". 


As before, a € C§°(IR”). As shown in §6, there is a short-time approximate solu- 
tion of the form 


(7.5) u(t, x) ~ S- S- \=9 ay (t, x) cer (t2) 


+t j>0 


where this time we have absorbed the factors i~/ into the amplitudes. We now 
want an asymptotic formula as \ — oo for the solution near the caustics, where 
(7.5) breaks down. 

Recall that the exact solution to (7.3) is 


(7.6) u(t, x) = R'(t) * uo(x), 


where ug (x) = a(a)e?*) and R’(t) is the t-derivative of the Riemann function 


R(t,x) =c £2 — |x|?) 7, for |z| < t, 
os (t.2) = a(t — |e?) [2 
0, for |z| >t 


if t > 0; see (5.46) of Chap.3. Note that, for fixed t > 0, R’(t) is a radial 
distribution that is singular precisely on the circle of radius t, centered at the 
origin. We expect u(t, x) in (7.6) to have qualitative features similar to 


vta)=7  f wolv)asty) 


(7.8) eas 


1/7 
= al a(x +t cis(s)) eel tteste)) da. 


TT 


where cis(s) = (cos s,sin s). The precise relation between u(t, x) and u(t, x) is 
most easily analyzed using techniques to be developed in the next chapter; see the 
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exercises after 89 of Chap. 7. At this point, we will concentrate on an asymptotic 
analysis of (7.8). 
In the simplest cases, an integral of the form 


(7.9) ra) = [ * als) PO) ds, a € CX(R), 


can be analyzed by the “stationary phase method.” This works when 7 is real- 
valued and has a finite number of critical points, each of which is nondegenerate. 
In fact, if there are no critical points of 4 on supp a, then I(A) is rapidly decreasing 
as |A| —> co, as can readily be seen by writing 


= u d ‘ irAw(s) 
1) = f als) (ara) e ds, 


and integrating by parts. 

Thus we can reduce our analysis of (7.9) to the case where 7 has exactly one 
critical point, at 59, assumed to be nondegenerate, and a is supported near sq. In 
such a case, either 7)(s) — ~(so) or its negative has a smooth, real-valued square 
root t(s), such that t(so) = 0, t’(s9) > 0, and we can use t as a new coordinate, 
to write 


(7.10) I(A) = e660) / ie dh BECHER), 


where a = +1. There are several ways to evaluate (7.10) asymptotically; one is 
to set x = t?, so 


1 ,; os, 
T(X) = ee) | b(al/2) + b(—71/2)] a7 1/2 er” Gy 
(7.11) ) 2 0 [b( ee y 


eve rlso) 1/2 [ao +ayA7* +--+], 


in view of results on Fourier transforms of singular functions in §8 of Chap. 3. 
Another method, in the context of the multidimensional stationary phase method, 
will be given in Appendix B at the end of this chapter. 

More generally, if there are a finite number of critical points s; of (s), all 
nondegenerate, then 


(7.12) TA) ~ So As(A)AT2ePVED, G(X) ~ cj FOGAT $+ 
J 
If a(s) = a(y,s) and w(s) = u(y, s) in (7.9) depend smoothly on the para- 


meters y, then we have (7.12) for I(A) = I(y,A), with ax; = axj(y) and 
w(s;) = w(y,s;(y)) depending smoothly on y, as long as the critical points 
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of w(y,s), as a function of s, are all nondegenerate and consequently depend 
smoothly on y. 

Let’s return to (7.8). We are assuming that Vy(y) 4 0 for y € supp a. Now, 
given x € R?,t > 0, let us denote by S;(a) the circle of radius ¢ centered at x. 
The way in which 5;(x) is tangent to various level curves Ug of yp determines the 
nature of the stationary points of the phase in the last integral in (7.8). Clearly, 
if 1/t is bigger than the largest curvature of any Ug, then S;(x) will have only 
simple tangencies with such level curves, so only nondegenerate stationary points 
of the phase will appear in (7.8). If y € Ug (so y(y) = @) is such a point of inter- 
section, then its contribution to the asymptotic behavior of v(t, 7) as 4 > oo is 
an amplitude times e**?(Y), in agreement with the geometrical optics construction 
given in §6, since in this case y(t, x) = y(y). This is illustrated in Fig. 7.2. 

On the other hand, suppose y € Sig and 1/t = «(y), the curvature of ig at y. 
Let x = y+tN(y), as illustrated in Fig. 7.3. Then S;(a) has higher-order tangency 
with dig at y. Let us assume that y is not a stationary point for « on dig, that is, 
if one travels on Xig at unit speed, « is monotonically increasing (or decreasing) 
at a nonzero rate at y. In such a case, Fig. 7.3. captures the behavior of the image 
of &., (for y close to 3) under F;, by our analysis of (7.2). In this case, the phase 
function in (7.8) has a simply degenerate critical point at y, so we have an integral 
of the form (7.9) with (so) = G, w'(s0) = w’(so) = 0, and W’”(so) 4 0 
(say it is > 0). We can treat this in a fashion similar to the nondegenerate case. 
This time, ¢)(s) — 6 has a smooth cube root near s = So, call it t(s), such that 
t(so) = 0, t’(so) > 0, and we can take t as a new coordinate to write 


(7.13) I(X) = e? (60) / b(t) e?” dt, bE CS(R). 
Parallel to (7.10)-(7.11), we can set « = t? and write 


Xp 


S,(x) 


FIGURE 7.2 Caustics II 
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Xp 


FIGURE 7.3 Caustics III 


Lo 
I(\) = ; ei" (s0) [ere o>” der 


as erelea) j-2/8 [a0 Aga. ae a 


(7.14) 


Note that the exponent in \~/° here differs from the exponent in \~!/?, which 
appears in (7.11). 

Now we want to examine the uniform asymptotic behavior of (7.8), as \ + oo, 
for x in a neighborhood of a caustic point x9. We will retain the hypothesis on the 
curvature made above, namely 7) = Fi (yo) with K(yo) = 1/t, yo € Ng, and & 
not stationary on big at yo, so the geometry of F;,(%,) for y near (7 is as illustrated 
in Fig. 7.3. Thus portions of F;(X,) lie on one side of the caustic set C;, namely, 
the image of the critical set of Fy. 

Take a point x on this side of C;, as illustrated in Fig. 7.4. For such «, the circle 
S,(x) is simply tangent to two level sets of y, at points y; and yo, as indicated 
in Figs. 7.4 and 7.5, and as x approaches C;, the points y; and y2 coalesce, to a 
point y such as depicted in Fig. 7.3. Consequently, if (s) = (t,x, s) = p(a + 
t cis(s)), then for 2 on one side of C;, ~)(s) has two nondegenerate critical points, 
8; and sg, which coalesce to a single degenerate critical point sg as x approaches 
Cy. 

The side of C; on which such z lies is foliated in two ways, by level sets of 
y(t,-). This arises because the graph of dy, a Lagrangian submanifold of T*R?, 
is mapped by the time-t geodesic flow to a Lagrangian manifold A, C T*R?, 
whose projection 


(7.15) an: A, — R? 
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FIGURE 7.5 Caustics V 


onto x-space has a simple fold, C;, mapped by 7 onto the caustic set C;. In other 
words, Dr(p) : T,Az — R? isomorphically for p € Az \ C;, while Dx(p) has 
rank 1 for p € C;, and the degeneration is of first order. A fold map between two 
two-dimensional regions is illustrated in Fig. 7.6. The following result elucidates 
the structure of such a folded Lagrangian manifold. 


Lemma 7.1. Fix t > 0. Given xq € C,, there exist smooth functions @ and p, 
defined on a neighborhood U of xo, with the following properties. 


(7.16) p=0, dp #0, ony. 


IfU*+ = {a € U : p(x) > 0}, then A, projects onto UT and is the graph of 
dip, where p~ is the “double-valued” function 
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FIGURE 7.6 Folding 


(7.17) o* (a) = O(a) = = o(a)*! “ 


Proof. That (7.15) is a fold implies that, over U*, A; is the graph of a “double- 
valued” closed 1-form, i.e., of dp*. We can put y~ in the form (7.17) by taking 


(7.18) 6(2)==(y* (a)+¢ (2)), olz)=|-(y'@)-—¢ (@) 


We need to establish that 6 and p are smooth on the closure of Ut in U, in par- 
ticular at C;. This is best seen by constructing a function ® € C™(A;) such 
that p* = ®oa7!. In fact, if h = S>€; dx; is the contact form on T*R? and 
L: Ay  T*R?, then c*« is closed, hence locally exact, and we take ® such that 
d® = .*«. Compare Exercise 5 in §15 of Chap. 1. There is a smooth involution 7 
on A,, interchanging points with the same image under 7, and we can set 


(7.19) = 5 (#+80;), n=[3(@-a0,)] 


These formulas define © and R as functions on A; that are invariant under 7, 
related to (7.18) by 


(7.20) O=0o7, R=pon. 
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It is clear that © is smooth, and hence the desired smoothness of 6 is established. 
To examine the smoothness of R and p, we reason as follows. Since (7.15) is 
assumed to be a fold, we know that, for x € Ut close to xo, 


C15(a)'/? < |dpt (x) — dp (2)| < C262)", 
for some C; € (0,00), where d(x) = dist(x,C;). This implies that 
C36°/7(x) < |p (2) — y(a)| < Cx5(2)*”, 
and hence, for z € Aj, close to z = 7~1(2x0), 
Cs6(2)* < |®(2) ~ 8(j(2)) | < Coste)’, 


where 6(z) = dist(z,C,). This implies that R, defined in (7.19), is smooth on 
A,, which in turn yields the desired smoothness of p, and also that do(x) # 0 on 
Cy. 


We now establish a result that puts the phase function in (7.8) into a normal 
form, near C;. 


Proposition 7.2. Fix t >0 and take xo €C;. For x near Xo, there is a family 
of diffeomorphisms of IR, depending smoothly on the parameter x, transform- 
ing (s) = v(t,x,s) = v(x +t cis(s)), for s near the stationary point so of 
wvo(s) = W(t, Zo, 8), to 


(7.21) w(s) = 38° — p(x)s + O(a), 


near s = 0. 


Proof. We first note that at 2 = 29 (so p= 0), ~(s) can be transformed to s?/3 + 
6(xo), as the argument leading to (7.13) shows. We can therefore consider the 
following situation. Suppose ~(7, s) is smooth, 


1 ae) 
(7.22) (0,8) = 3° 57 ps0 0%) 20, 


We want a smooth map, of the form 
(7.23) (7,8) + (7, f(7,8)), 


transforming 7) to 7(r, s) = s°/3 — p(r)s + 6(r), where @ is determined as fol- 
lows, for 7 > 0. By (7.22), for 7 small and positive, w(r, -) has two critical points, 
close to 0, at s = s1(T), S2(7), and we take 6(r) = (w(r, 81) + U(r, 82)) /2. The 
set 


7. The formation of caustics 563 


FIGURE 7.7 The Curve 


(7.24) P= {(7,s) : 04(7, s) = 0} 


is a curve tangent to the s-axis at (0,0), as pictured in Fig. 7.7. There is a smooth 
involution of I’, interchanging the points with the same 7-coordinate, and 0(7T) 
is the value of the symmetrization of vp with respect to this involution, so @ is 
easily seen to be a smooth function of rT. 

We may as well subtract 6(7) and try to achieve the form 7)(r,s) = s°/3 — 
p(7)s. Note that, in this model case, the analogue of (7.24) is 


(7.25) T ={(7,8):s= +/p}, v(t, +./p) = opr. 


So p(T) is uniquely defined for 7 > 0 by the requirement that p(r) > 0 for r > 0 
and 


(7.26) y= 5 a(r), onT. 


To put it simply, +(2/3)p(7)°/ are the critical values of (r,s), as a function 
of s (once 0(7) has been subtracted). Given that now vp has been arranged to 
be odd with respect to the involution of I’ described above, it is easy to show 
that p(7) is a smooth function of 7, via the sort of argument used in the proof of 
Lemma 7.1. Also, do £ 0 at rT = 0. 

Having specified p(r), we start to construct the diffeomorphism, of the form 
(7.23). We want f(0,s) = s. For r > 0, the fact that (7, -) and 7(r,-) have 
identical ranges, for s < —,/p(r), for —\/p(T) < 5s < v/p(r), and for s > 
,/ p(T), implies that there is a unique homeomorphism s ++ f(r, s) transforming 
w(t, -) to X(, -). This homeomorphism is clearly a diffeomorphism (as a function 
of s), away from s = +,/p(r), and, by the sort of argument leading to (7.10), 
we see that, for each fixed 7 > 0, itis a diffeomorphism in a neighborhood of these 
points too. For t < 0, both 7)(r, -) and ¢b(r, -) have no critical points (near 0), so 
the existence of a unique diffeomorphism s +> f(r, s) transforming ~) to w is easy. 

The continuous dependence of f(7, 5) on 7 is easy to establish, but the smooth 
dependence on 7, at T = 0, is a bit more subtle, so we finally turn to that point. 
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We will use a device similar to that used in the proof of the Morse lemma (given 
in Appendix C, §8). 

We may as well use p instead of 7 as a coordinate, so we assume we have a 
smooth function w(p, s) satisfying 


(7.27) (0,8) = ,0sb(0,0) < 0, 
and, for p > 0, 
(7.28) A.0(p,+/7p) =0, v(p,+/p) = #0”. 


We want to produce a diffeomorphism of the form (p,s) +> (p, f(p,s)), such 
that f(0,s) = s, transforming w to 


1 
(7.29) V(p,8) = 38° — ps, 


a function that also satisfies (7.28). Now consider the family of functions connect- 
ing w and w: 


(7.30) W(o,p,s) = (1—o)w(p,s) + o(5s° — ps). 


Thus W(0,p,s) = wW(p,s), U(1,p,s) = s°/3 — ps, and, for any fixed o € R, 
W(o, p, 8) satisfies (7.27) and (7.28). We will construct a family of diffeomor- 
phisms s +> F(o,p,s) = Fo,p(s), transforming ~(p, -) to U(a, p,-), generated 
by a smooth family of vector fields on a neighborhood of 0 in R: 


rs) 
(7.31) X (a, p, 8) = E(0, p, §) 3s" 


Given X (co, p,s), F is defined by F'(0, p, s) = s, and 


0 
(7.32) ae F(o,p, 8) = &(0,p, F). 
a 


If F3 ,9(8) = 9(Fo,o(s)), then 


a , « (4a 
(7.33) = Fi pG0(8) = Fiplxe,.G0 + Fe p(=90); 
and this quantity vanishes, provided 


0 


O 
(7.34) E(o, P; s) 082” — ae" 
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Applying this to go(s) = go,p(s) = U(o, p, 8) = Wo,p(s), we have 
(7.35) F* ,Wo,p(8) = Yop(s), Vo, 
provided 


8° — ~s — 8 
29) a ne 3 


Now the denominator of this fraction vanishes on I’, but by (7.27) the gradient 
of the denominator does not vanish on I’. Meanwhile, the numerator vanishes 
to second order on I’, so the quotient € is C’'°° and vanishes on I. Thus (7.31) 
generates a smooth flow (which leaves I’ invariant, of course), and the proof of 
Proposition 7.2 is complete. 


In order to analyze (7.8), we are now led to discuss the asymptotic evaluation, 
as \ — oo, of integrals of the form 


1 baa . 
(7.37) T(a; LL, d) / a(s) eid(s?/3-Hs) ds, 


oe 
given a € C§°(R). Such integrals are called Airy integrals. We fix K <oo and 
assume |j1| < A?. The phase function y,(s) = s°/3— us has derivative y/,(s) = 
s* — pw, with roots s = +,/u, which are the stationary points of the phase when 
p= 0. 

Our first goal is to show simultaneously that the uniform asymptotic behavior 
of (7.37), as \ — 00, |u| < K?, depends only on a(s) for -2K < s < 2K, 
and that (7.37) makes sense for a wider class of amplitudes a(s); namely we 
allow a(s) € $%”(R), for some m € R (ie., |DZa(s)| < Cj(s)~%). A general 
a € S7"(R) can be written as a sum of a term in C§°(—2K, 2K) and a term in 
Si?(R) which vanishes on [—(3/2)K, (3/2) K]. If a2(s) has the latter property, 
then we can make a change of variable, y = ,,(s), and write 


1 ; m 
7.38) Daas d) =f oulye™ dy, byly) € SR), 


where b,,(y) depends smoothly on jz. We know that by (A) is an element of S’(R) 
that is smooth on R \ 0 and rapidly decreasing as |\| > oo, from material in §8 
of Chap. 3. Thus we can take a € S{”(R) in (7.37). 

In particular, we can take a(s) = 1, obtaining 


Lf seed 
(7.39) T(1; p, 2) = = | eid(s?/3-Hs) ds = A-V3 Ai (—pr?/3), A>0, 
T 


—co 


where Ai(2) is the Airy function 
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(7.40) Aja i eile" /3+28) dg — T(1; —2,1), 


~ Or 


for x € R. If we set wp = +1, \ = 2°/? in (7.39), we have 


Ai(x) = #7? T(1; -1, 29/%), 


(7.41) 

Ai(—2x) = /? T(1;1, 23/*), 
for x > 0. In these cases, js is a fixed, nonzero quantity, and we can apply the 
stationary phase method, to get 


(742)  Ai(x) =O(e-™), Ai(—2) ~ = a4 cos (20% _ ). 


T 


as x — +oo. Let us also note that, since Ai(x) (as an element of S’(R)) is the 
inverse Fourier transform of eis*/ 3. which satisfies an obvious first-order, linear 
ODE, then A7(x) satisfies the differential equation 


(7.43) Ai!(x) — x Ai(x) = 0, 


known as Airy’s equation. It follows that Ai(a) continues to an entire holomor- 
phic function on the complex plane. The graph of Ai(a) is shown in Fig. 7.8. 
It was constructed by numerically integrating (7.43), using initial data 


= 3-2/8 dee 0) _ ae 


(7.44) Ai(0) ry" ry 
3 


Note that Ai(a) is real for x € R. In fact, (7.40) can be written as 


(7.45) Higjas i; cos( 53° + ws) ds. 
27 Joo 3 
Taking the Airy function as a basic special function, we see that (7.39) gives 
the uniform asymptotic behavior of (7.37), for js in any bounded interval, in the 
case a = 1. We now seek a uniform asymptotic expansion of (7.37), of a similar 
form, for general a € S7”(R). In fact, the general case will involve both the Airy 
function and its derivative: 


(7.46) Ai (x) = — / g ells?/3+%58) dg. 
27 Joo 


To obtain it, write 


(7.47) a(s) = a9 + a18 + by (s)(u — 8”), 
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FIGURE 7.8 The Airy Function 


where b,,(s) € S{(R), € = max(m/2, 1/2), with smooth dependence on ju. Then, 
for A > 0, 


(7.48) 

1 
L(a; 1, ) = apd 1/3 Ai(—pd?/%) Shag 4l? Ai’ (—pd?/) -5 LUD, SpA); 
where we have used 


F 1 d .y/.3 
74 62), iA(s?/3—ps) = ir(s” /3—s) 
(7.49) (u—s*)e = ae 


and integration by parts to evaluate 
(7.50) [uts) (uu — 8?) etr(s?/3—H8) gg, 


Now we can apply the same transformation to Z(b/,; 14, \) and iterate this argu- 
ment arbitrarily often, to establish the following result: 


Proposition 7.3. Given a € S7"(R), as \ + +00, we have 


(7.51) 
L(a; u, ) ~ bo(u, AAV? Ai(—pr7/) — iby (M, AYAW?7/? Ad’ (—p?!°), 

where 

(7.52) b(t, A) ~ bjo(H) + bj1(W)AW* + Bjo(M)A TZ + °, 


and where bj,({s) are smooth in js. The expansion (7.51) is valid uniformly for 
|u| < K?, for any fixed K < oo. 
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When we combine this with an application of Proposition 7.2, we have the fol- 
lowing result on the behavior of (7.8) near a caustic set. 


Proposition 7.4. Granted the geometric hypotheses made on the formation of the 
caustic set C, before Lemma 7.1, the oscillatory integral (7.8) has the following 
asymptotic behavior for x near C, as \ —> +00: 


v(te,A) =A“ [bolt x, \) Ai(—p(t, 2)d?/?) 
(7.53) | 
~ id Y3bj (t,x, ) Ai (—p(t, 2)X*/3)] eO(ta) 


mod O(~°), where 
(7.54) b(t, z, d) ad bjo(t, x) + bsi(t, z)\~* + bya(t, “A i ; 
and the functions p(t, x), 0(t, x), and b;,(t, a) are smooth in (t, x). 


Finally, u(t, x) = u(t, x, A) in (7.6) has a similar expansion, where the leading 
factor \~'/8 above is replaced by \1/°, as will follow from results in Chap. 7. 

The next order of complexity of a caustic is illustrated in Fig. 7.9. It arises 
when we alter our hypothesis on the curvature of level curves of y. In this case, z 
is a point on a level curve at which « is stationary, in fact a (nondegenerate) local 
maximum, such that «(z) = 1/t. On nearby curves, this is not a locally maximum 
value of «; the set where « = 1/t is denoted K, and is mapped by F; onto the 
caustic set C;, which is singular at the “cusp” v = F(z). The asymptotic behavior 
of the functions (7.6) and (7.8) on a neighborhood of v is more complicated than 
(7.53). A discussion of this (and more complicated caustics) can be found in the 
last chapter of [GS]. See also [AVG] and [Dui]. 

In the last chapter of [GS] one can also find an analysis of the wave equation 
near a caustic of the fold type considered above, making use of a result similar 
to Lemma 7.1, but replacing the use of Proposition 7.2 by results in “microlocal 
analysis.” The next chapter of this work includes a brief introduction to this area; 
other applications of microlocal analysis to topics in wave propagation can be 
found in Vols. 3-4 of [Ho], and in [Tay]. For other approaches to the type of 
caustic considered here, see [Lud] and references given therein. 


Exercises 


1. Fix r > 0. Let y € E’(IR®) denote the unit-mass density on the circle of radius r: 


(u, Ir) = = ihe u(r cos 0,rsin 6) dé. 


Show that there exist 
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Fg) 
2p 


K, (C, = F,(H,)) 


FIGURE 7.9 More Complex Caustic 


ar (A) a eas ie (aor + air +: -), 
BrlA) oA (Bor + Bed Es), 


such that, modulo O(|£|~*°), 


sinr|é| 


Ig)” 


(7.55) Y(€) = ar(I€|) cosr|é| + Gr(IE1) | — 00. 
(Hint: Use the stationary phase method.) 
Compare with formula (6.56) in Chap. 3, in the case v = 0, in view of the identity 


4r(€) = eJo(r|él)- 


2. Give a proof that if f € C°(R) and f(x) = f(—a), then there exists g € 
C®(R) such that f(#) = g(x”). (Hint: For fixed but large k, compare f(./x) with 
ye iK<k FPP (0) /(25)!. Show that if F’ € C’°°(R) vanishes at x = 0 to order 2k + 1, 
then F'(./z) belongs to C* ([0, 00)).) 

3. Extend the result of Exercise 2 to show that if (7.15) is a fold and f € C°(A:) is 
invariant under the involution 7 of Az, which interchanges points with the same image 
under 7r, then there exists g € C'°(IR*) such that f(x) = g(m(z)). This result is used 
in the proof of Lemma 7.1 and that of Proposition 7.2. 

For more material on folds, see [GoG]. 
4. Suppose R? is replaced by R® in (7.3). Analyze the following variant of (7.8): 


vo= 7, fw) asey=Z fate + yee asty) 


ly—a|=t S2 
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2. 


Recall the formula for the Riemann function in this case, and relate v(t, x) to u(t, x). 


Exercises on the Airy function 


. Show that 


20 
7 


AGS nier ae, 
7 


where L is any contour in C that begins at a point at infinity in the sector —7/2 < 
arg(v) < —7/6 and ends at infinity in the sector 7/6 < arg(v) < 7/2. The integral 
on the right is convergent for all z € C. 

Show that, for | arg z| <7, 


(7.56) 


1 : a 1 
Ai(z)= | cos (30) exp (-t2") t/?dt=W(z) e7 2/38 
0 


where 


“1/4 34 = 
(1.57) W(z)~zi/4 > aye ae ao = 47 3/2 2-00, larg z|< a6. 
jg=0 


In particular, 


At(x) ~ ge a Mtg ON x — +00. 


If we set Ax (z) = Ai(eF?7#/3 2), show that A+(z) also satisfies the Airy equation. 
Evaluate A(x) asymptotically as x — +00, showing that |Ai(2)| — co asa — 
+oo. Show that any two of the functions Ai, A,, A_ form a basis of solutions to 
Airy’s equation wu” (z) — zu(z) = 0. Show that 


(7.58) A_(z) = A;(z), and Ai(z) = e"/?A4(z) +e-7/9 A_(z). 


4. 


(7.59) Ai(z) = + (2) Kia(52”), epee 


-2/3 
(7.60) Ai(0) = ee) = 


Using Exercise 2, note that, for x > 0, 
ae (—2) = W(e%/Bge (2/3/29? 


which, in light of (7.57), implies the second part of (7.42). 
Show that 


3 3 3 


where K,,(r) is the modified Bessel function, defined by (6.50) of Chap. 3 (and satis- 
fying the modified Bessel equation (6.52). (Hint: Denoting the right side of (7.59) by 
u(z), show that u(z) satisfies Airy’s equation and has the same asymptotic behavior as 
Ai(z), as z + +00 in R*. For the behavior of K,(r) as r — +00, use Exercise 2 in 
86 of Chap. 3.) 


. Show that 


20 3 


i 
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as asserted in (7.44). (Hint: Show that, given v > 0, 
Ku(r) ~T(v)2” 19”, as r\ 0. 


For the last identity in (7.60), use [(z)'(1 — z) = w/(sin7z).) 
6. With A+(z) as in Exercise 3, establish the Wronskian relation 


Al, (z)A_(z) — Ay(z) A(z) = —. 


(Hint: Once you show that the right side is a constant c, use the asymptotic behavior 
of Ai(a) as x — +00, obtained via Exercise 2, and the corresponding asymptotic 
behavior of A‘, (x), to evaluate c.) 

7. Deduce from Exercises 5 and 6 that 


-1/3 
(7.61) Ai’ (0) 1 31/6p (5) ae 


as asserted in (7.44). 


8. Show that 
P(3) 27, (3) Cae mace (3) 
ra) ve \6)’ TE)” ve \6)’ 
Using P'(1/3)P'(2/3) = 2/V3, relate P(1/3)? and ['(2/3)? to ['(1/6). (Hint: Use 
the duplication formula for the gamma function, established in Chap. 3.) 
9. Consider the problem of deriving numerical approximations to ['(1/3) and I'(2/3). Try 


to obtain 10-digit approximations to these quantities. Then write a computer program 
to produce the graph of y = Ai(a), shown in Fig. 7.8., by solving (7.43) numerically. 


8. Boundary layer phenomena for the heat semigroup 


Let be a compact Riemannian manifold with nonempty boundary. Let A denote 
the Laplace—Beltrami operator on , with the Dirichlet boundary condition, i.e., 
with domain D(A) = H?(Q) N H4(Q). As we have seen in §1, {e’ : t > 0} 
is a strongly continuous contraction semigroup on L?(Q) for each p € [1,0o). 
Also e'“ is a contraction on L°(Q) and on C(Q), for t > 0, but the family 
is not strongly continuous as t \, 0. Indeed, given f € C(Q), it follows from 
(1.11) that u(t) = ef € sD, for each t > 0. In particular, u(t) is smooth on 
Q and vanishes on 02 for each t > 0, so u(t) cannot converge to f in sup-norm 
if f does not vanish identically on OQ. As noted in (1.15), we do have uniform 
convergence u(t) > f if f €C,(Q), i.e, f €C(Q) and flag = 0. Here we will 
show that e’“ f —> f uniformly on each compact subset of 2 when f is continuous, 
and smoothly when f is smooth, and discuss the boundary layer phenomena that 
arise on a small neighborhood of OQ as t \y 0. 

To accomplish this, we will use wave equation techniques, previewed in §2. To 
start, if ¢ € S(R) is an even function, we have 
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1 > 
(8.1) p(v—-A)f = — | p(s) cossV¥—A f ds, 
V2 Joo 
as follows form the Fourier inversion formula and the eigenfunction decomposi- 
tion of L?(Q). Taking y(A) = ¢2(A) = e-” gives, as in (2.13), 


1 


tA ¢ _ 
(8.2) e f=Te 


/ e © /4t cos s/—D f ds. 


Note that if we use (—A)* cos s/—A = (d/ds)?* cos s/—A and integrate by 
parts, we get 


3) (-A)—(V=A)P = | G(s) cos sV=A Fas, 
TT J—oco 
hence, since || coss/—Af||z2 < ||fllz2, 


(8.4) eV=B) Finan) SCs (f }9(s)| 4 II fllz2ca)- 


We now have the following localization result. 


Proposition 8.1. Let O; be smoothly bounded regions satisfying O; CCO9C CQ. 
Let f € L?(Q) and set u(t) = e'“ f. Then 


(8.5) flo, =0 => we C™([0,00) x O1). 


Proof. Since 0/u = A/u, it suffices to show that u(t)|o, is bounded in H*(O,) 
for t € [0,1], for each k. We proceed as follows. Pick a > 0 such that dist(p,  \ 
Oo) > a for each p € Oj. Pick an even function w € C§°(R) such that 71 (s) = 
0 for |s| > a, Yi(s) = 1 for |s| < a/2, and set ~2(s) = 1 — y1(s). Using (8.2), 
write 


(8.6) et f = (VA) f + 8(V—-A)f, 


where 


1 
VAnt 


Using (8.4), we have, for each k, N € N, 


(8.7) b1(/—5) f = 


. wb, (s)e~® /4 cos s/—A f ds. 


(8.8) |®5(V—A) f llae(ay < Cr,nt || fll z2(@@)- 


Meanwhile, if f = 0 on Op, then finite propagation speed gives 
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(8.9) |s| <a => cossV—A fle: =0, 
so ®{ (/—A) f = 0 on Qj, and hence 
(8.10) eA f = bb (V—A)f on O4. 
This proves the proposition 


Corollary 8.2. Take O; as in Proposition 8.1, and f € L*(Q). Then 


(8.11) flo, € H*(Oo) = ue C((0, 00), H*(O1)), 
and 
(8.12) flo, € C(Oo) — u € O([0, 00) x O1). 


Proof. Take O; 2 such thatO, CC Oy /2 CC Op. Incase (8.9), set f = g+(f— 
g) where g € H*(Q) is supported in Oo and f —g = 0 on O12. We have g € Dy, 
so e'g is continuous in t € [0,00) with values in Dy. Meanwhile, Proposition 
8.1 (with Oo replaced by Oj /2) applies to e'4(f — g). 

In case (8.10), set f = g + (f — g) where g € C(Q) is supported in Og and 
f — g = 0 on O;;z and argue similarly, using the strong continuity of 1 te 
t > 0} on O,(Q). 


We now take f € C%°(Q) and seek a detailed uniform analysis of e'4 f (x) as 
t \, 0, for 2 € Q, particularly for 2 near OQ. We follow an approach taken in 
[MaT], which arose in the investigation of fluid flows with small viscosity. For 
more on this, see Chap. 17, 86. To proceed, we can assume 2? is an open subset of 
a smooth, compact Riemannian manifold 7, without boundary. Let L denote the 
Laplace—Beltrami operator on M, and let f € C™(M) be an extension of f. We 


have e'” f € C™([0,00) x M), so, for each k, N EN, 


Nak 
s - tk , ~ 
(8.13) e” f(x) = f(z) + >> ye flo) + Ry(t, 2), 
k=1 
with 
(8.14) R(t, low) < Cent™, O<t<1. 


We want to compare e’4 f and e“” f on Rt x ©. We use the wave equation for- 
mulas (8.6)-(8.7), supplemented by 


(8.15) ef! f = O1(V—-L)f + ®4(V-L)f, 
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with ot as in (8.7), where we have picked a > 0 and constructed 1); (s) as above 
(8.6). As in (8.8), we have, for each k, N € N, 


(8.16) ]25(v —L) f\lz« cay < Crt || fll cc): 

Thus, modulo a negligible contribution, we have, for x € Q, 

(8.17) et f(x) — e f(x) =f w1(s)e~* /4*V(s, 2) ds, 
mE 


where 


(8.18) V(s,2) = cossV—A f(x) —cossV—L f(z), s>0, c€Q, 


satisfies 
OV 
oer AV =9, on RxQ, 
et) V(s,xz) =0, for s <0, 
V(s,r) =—xpe+(s)v(s,x), for x € OQ, 
with 
(8.20) v(s,x£) = cossV—L f(z). 


Note that v € C*(R x M), and, parallel to (8.13), we have 


N 
(8.21)  cossV—L f(x) = f(z) + S(-1 L* f(x) + Ry(s,2), 


k=1 


with an estimate like (8.14) on the remainder. Note that the boundary value 
imposed on V in (8.19) is piecewise smooth, with a jump across {s = 0}. Hence, 
if a > 0 is picked small enough, the progressing wave expansion discussed in 
§6 is applicable to the description of V(s,x) for s € [0,a], x € Q. Also, finite 
propagation speed guarantees that for s > 0, x € Q, 


(8.22) V(s,z) =0 for v(x) > s, 
where 
(8.23) p(x) = dist(#, 0). 


Pick a > 0 so small that 
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(8.24) C={r EN: g(z) <a} > vec”, 


and use this value of a to pick 7, and tz in (8.17). Then, for s € [0, a], V(s, x) 
is given by a progressing wave expansion of the form 


(8.25) V(s,x“) ~ S- a;(s,2)(s — y(x))4, 


j20 


with coefficients a; € C°([0, a] x Q), determined by certain transport equations. 
The meaning of (8.25) is that foreach N € N, 


N 
(8.26) V(s,2) = S- a;(s,x)(s — y(x)) + Rn(s,2), 
j=0 


where 
(8.27) Ry(s,z) =0 for v(x) >s, Ry € CN ((0,a] x Q). 
Writing 

(8.28) ag(s,@) = ao(p(x), ©) + 1(s,x)(s — v(x), 


we can shift the latter term onto the 7 = 1 term in (8.26). Continuing this process, 
we have 


N 
(8.29) Via,2)= x b;(z)(s — y(2)), + Rn(s, 2), 


j=0 


(with slightly altered Ry, still satisfying (8.27)), valid on (0, a] x Q, with by € 
C(Q). Inserting this into the formula (8.17), we have (modulo a negligible con- 
tribution) 


N 
ef f(x) — = 5k 
g7=0 


+f e-*"/4* Reus, )h1(s) ds 


io fe —*7/48(5 — so(a))4 da () ds 
(8.30) 


Elementary estimates show that 


(831) | * get lH (= Gi apate\as 


is rapidly decreasing as t \, 0, together with all x-derivatives, so the sum over 
0 < 7 < N in (8.30) has the identical asymptotic behavior as t \, 0 as does 
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N 
>2 bs (2) W(t, 2), 
(8.32) 7=0 
1 oo age . 
WAiLz = Te e* /4#t(g — v(a))), ds. 
A change of variable gives 
; a j/2 7p. p(x) 
(8.33) W; (t,x) = 2(4t) £, (2). 
where 
1 ve j 
Ej(y)= ef e* (s—y)’ ds 
(8.34) oe 
aa d —8° 289 oI dg 
vm Jo 


Using (8.27), one easily bounds the last integral in (8.30) by CWy(t, x). Conse- 
quently 


N 
t t 5/2 y(z) 
(8.35) F(x) — et” F(x) = 278 )(4t) yE, (Te ) + By (t,2), 


with 
(8.36) [Rav (ts VIlcog < CtN?. 
Similar arguments give estimates || Ry (t, lce@ S Ct™/?, for each k, M EN, 


if N is large enough. 
Putting together (8.13) and (8.35), we obtain our main result: 


Proposition 8.3. Given f € C®(Q), 


N 
F(a) = fle) + gahse) 
(8.37) — 


- Yai )(4t)4/2B, (3) + Ry(t,2), 


where b; € C% (Q) are as in (8.29), and, for each M,k €N, there exists N such 
that 


(8.38) IRv(t, Vlow@ < Cunt’, te (0,1). 
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Remark: It follows readily from (8.19) to (8.21) and (8.29) that b;|a¢ = 0 when j 
is odd. Also bolan = flag, and Eo(0) = 1/2. 


The following corollary, which follows by inspection of (8.37), is of indepen- 
dent interest. 


Corollary 8.4. Given f € C®(Q), we have 


(8.39) Ve fllzs@ay < Cr, Vt € (0,00). 


Remark: Such a uniform bound does not hold in any L?-space with p > 1, unless 
flan = 0. 


Exercises 
1. Suppose f € C'™(Q) and fla = 0. How does that affect the behavior of (8.37)? 
Produce an improvement on (8.39) in this case 


2. Establish analogues of the results of this section, with the Dirichlet boundary condition 
replaced by the Neumann boundary condition. Note the differences in the results. 


9. Schr6ddinger equations on Euclidean space 


In this section, we consider Schrédinger equations on R x R”, starting with the 
1D case, 


Ou .07u 
(9.1) OE tae 


for t,x € R, with initial condition 
(9.2) u(0,x) = f(x). 


Note that the partial Fourier transform 


(9.3) a(t,€) = = /. u(t, ze **5 dz 
T J—oco 
satisfies 
(9.4) dpti(t,€) = -i€a(t,6), 2(0,€) = f (8), 


so 
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(9.5) a(t,€) =e" FS), 
and we have 

(9.6) u(t, 2) = Fre F f(a). 
Since, by the Plancherel theorem, the Fourier transform 
(9.7) F: L?(R) — L?(R) 


is bijective and norm preserving (i.e., unitary), with inverse F*, we see that (9.5)— 
(9.6) defines a solution 


(9.8) u(t, xv) = S(t) f(x) 

to (9.1)-(9.2), for f € L?(R). Furthermore, for each t € R, 
(9.9) S(t): L?(R) —> L?(R) 

is unitary, with inverse 


(9.10) S(t)! = S(t)* = S(-2). 


Relation to heat equation, and integral formula 


We have defined the solution operator S(t) : L?(R) — L?(R) to the 
Schrédinger equation in (9.6)-(9.8). If f € L?(IR) and also f € L1(IR), then 
S(t) f is given by the absolutely convergent integral 


1 OS gees 
9.11 S(t == mie ae de. 
@.1 fe) =e fet Fe ag 
We relate this to the solution operator 

1 ee 24 . 
9.12 H(t —— ~# Hed 
(0.12) (Qe) =e fe Fe ag 
for the heat equation 

du OPu 


As seen in §§3.3—3.5, we have, for t > 0, 
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(9.14) H(#)f (2) = ; fa— Wii) dy 
with 

= i te tive g 
oe Hy) == fe é 


= (Ant) V2eV/4t £0. 


Now, we can extend (9.12) and (9.14)-(9.15), from t € (0,00) to complex ¢ with 
positive real part. Let us denote such a complex number by s + tt, s > 0, ER. 
We have 


H(s+it)f(« = |e (st#t)e” F(e)ei* dé 
(9.16) x = 
= se | He v) Hovey) 
with 
(9.17) Ag+ it(y) = [27(s ae ity) eV ACs tit) s> 0, tER. 


The Fourier integral representation (9.16), with the Plancherel theorem, gives 
(9.18) H(s+it) : L?(R) — L*(R), || (s+it)||cz2) =1, 8 >0,t ER, 
and furthermore, for f € L?(IR), 


(9.19) S(t) f= lim H(s+it)f, in L?-norm. 


Comparison with (9.11) shows that, if f € L?(R) and also f € L1(R) (so f € 
C(R)), then, for each ¢t € R, 


(9.20) H(s+it)f(x) — S(t)f(x), uniformly in z, as s \, 0. 


If, in addition, f € L1(IR), then we can pass to the limit s \, 0 and write, for each 
te R\O, 


(9.21) S(t) f(x) = i f(a — y)Se(v) dy, 
with 


(9.22) Si(y) = Cs 4a 
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Here 
(0+ it)—1/? = lim (s+ it)" /? = e-*/4}t|-1/2, if t > 0, 
(9.23) at 
erento. Sede 0: 
Note that 
ria _ 1 ti 
(9.24) e™/4 — ae 


Having this formula for S;(y), we can readily extend S(t) to act on f € L*(R), 
for t £ 0, obtaining 


(9.25) S(t): L'(R) > L*(R), |S) fllz~ < — IIfillas- 


/4r|t| 


The Fresnel integral 


The Fresnel integral arises in the study of S(t)xa,p, where, for a,b € R,a < b, 
we set 


ap(t)= 1, ifa<ar<b, 
(9.26) Xas(e) 
0, otherwise. 


For simplicity, we take t > 0, though we note that, if f € L1(IR) or L?(R), 


(9.27) f real valued => S(-t)f = S(t)f. 


By (9.21)-(9.23), we have 


en ti/4 ua Bias 
(9.28) S(t)Xa,9(«) = / eW¥ [Ait dy, 
x—b 


We are hence motivated to look at 


en ti/4 x 


(9.29) vant Jo 0 


where we bring in the Fresnel integral 


(9.30) F(z) = | ” iv? 
r(x) = e'Y dy. 
vr Jo 
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Using this special function, we have, for t > 0, 


(9.31) S(t)xXa0(x) = a(F) (=). 


We are now motivated to study the special function Fr(x), defined by (9.30). 
Clearly 


(9.32) Fre C°(R), Fr(—a) = —Fr(z). 


We will show below that 


1 
(9.33) lim Fr(x) = £5: 
In particular, Fr is bounded, 
(9.34) |Fr(z)| <A<o, VaeR, 


for some A < oo. To get started, note the identity 


i a. dt « 
(9.35) ay (<e'v") = viel” — Sev”, 
y y 


or equivalently 

1 LL 1 , 
(9.36) a =-0y(—el¥”) 4+ —— iv”, 
which gives, forO << R< «ow, 


Bog 1 eR eit” 1 7? 1 52 
9.37 iv dy = ( ) / iv? dy. 
oo) i ee aR 7. od. 


Hence, for each x > 0, 


Pa 
. 1 aaa eit oy ee 
(9.38) jim, Fr(R) = Fr(x) + 5 (™) / {- Pe +f a y dy}. 


This shows that the limits on the left side of (9.33) exist. It remains to identify 
them. 
For this evaluation, we look at 


(9.39) (a) = f ew dy = er 
0 
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valid for a > 0 via the change of variable t = a!/?y. Now both the integral 
defining I(a) and a~'/? are holomorphic in {a € C : Rea > 0}, so this identity 
holds in the right half plane. Hence we have 


| ee” dy = I(e—1) = VE le ee 
(9.40) a - 
VE oni/A 
> e’", as eX, 0. 
We also know that 
R 209) 
(9.41) | eY dy— L, as Roo, 
0 


for some L € C. The result (9.33) follows from the fact that, if (9.41) holds, 


42 li ey" iv” dy = L. 
(9.42) Him fo evel’ dy 


This implication is known as an Abelian theorem, and is established below. 
Here is a variant of (9.31), which applies to a general class of initial data. 


Proposition 9.1. /f f € C§(R), then, for t > 0, 


co 


(9.43) se)s(o) = | F(A) ie naw, 


with a similar formula for t < 0. 


Proof. Denote the right side of (9.43) by u(t, x). Integration by parts gives 


(9.44) u(t, x) = iz Oy (Ta) f(a —y) dy. 


But 


Ov F(T) ~ Th F(z) 


(9.45) — mae, a 
Vat Vr 
= Si(y), 


the second identity by (9.30), and the third by (9.22). Then (9.21) yields 


(9.46) v(t, x) = S(t) f(a), 
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as asserted. 
We then have the following counterpart to (9.25). 
Corollary 9.2. Given f € Cj (IR), A as in (9.34), 
(9.47) IS(t) fll < All f'n. 
We next give a second proof of (9.33), taking off from (9.38), which shows that 


(9.48) lim Fr(z) = +B, 


for some B € C. We seek another proof that B = 1/2, not relying on the Abelian 
theorem used above. Our reasoning proceeds as follows. From (9.31) and (9.48), 
we know that 

(9.49) S(t)xa,o(@) —+ 2B, uniformly in x € [a+e,b— 4], 

as t \, 0, for each ¢ > 0. On the other hand, we know that 

(9.50) S(t)Xa,b —> Xa,b, in L?-norm, 


as t —> 0. Comparison of (9.49) and (9.50) forces B = 1/2. 


REMARK. Given B = 1/2, we can rewrite (9.38) as 


2 
fi ape tr ag 
(9.51) Fr(a) 5 5; (7) { . ‘ en dy}, 


for x > 0. Going further, we can write 


(9.52) 


1 ex” 1 a8 
=—{- 3], —el¥” dy} 
All oe | yee abs 


and proceed inductively to derive a complete asymptotic expansion, as  — ov, 
of Fr(z). 


An Abelian theorem 


The following result justifies passing from (9.41) to (9.42), used to prove 
(9.33). For more general Abelian theorems, see Appendix A.5 of [Tay4]. 
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Proposition 9.3. Let f : R* — C be bounded and continuous. Assume 


R 
(9.53) lim | f(t)dt = L. 
R-0o 0 
Then 
. —et = 
(9.54) : e ~ f(t)dt = L. 


Proof. Set g(s) = J} f(t) dt, so g(R) + Las R — co. Then, fore > 0, 


(9.55) =< | e *'g(t) dt 
0 


Pick 6 > 0 and then take KK < oo such that 
(9.56) t> K => |g(t) —L| <6. 
Then 
| e~*'|g(t) — L| dt 
0 
K foe) 

(9.57) ee 7 e-®lg(t) — L| dé +68 | ene dt 

0 K 


< (sup |g(t) - |) Ke +6, 
t<K 
Hence 


(9.58) limsup | | e-*t f(t) dt — L| <6, V5>0, 
ENO (0) 


and we have (9.54). 
Higher dimensional results 


Higher dimensional Schrédinger equations include 


(9.59) —=iAu, u(0)=f, 


9. Schrédinger equations on Euclidean space 


where A = 07 + --- + 02. More generally, we can consider 


Ou 


(9.60) = iLu, ul0) = f, 

where 

(9.61) L= 5 _ bpd, be ER\O. 
k=1 


We can get the fundamental solution as a product, 
(9.62) et! §(x) = eith1 91 5(2,) vee eithnOn 5 (an), 
where, by (9.22), 


(9.63) cits §(a,) (s + Amibyt) 1/267 eh /Aibat 


= lim 
s\0 


Consequently, 


(9.64) e'” §(x) = lim | [ (s + 4rribyt)1/2e7 © wp /Aibat 


6 
ame 


We have a compact formula if we define B = B' € M(n,R) so 


(9.65) BE-E=) bf, A=(4B)*. 
Then (9.64) becomes 


(9.66) e! §(a) = lien, det (47iBt + gy ee, 


Going further, we can consider 


(9.67) e"P), P(D)=—S > dj,DjDe,  (bjx) = B = B", invertible. 


585 


Switching to an orthonormal basis of R” in which P(D) takes the form (9.61), 


we have the following. 


Proposition 9.4. For P(D) given by (9.67), we have 


1/2. 
(9.68) e@P(P) §(2) = lim (det(ariBe + s)) gare 


s\0 
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with A = (4B)-}. 


Since 
(9.69) Fer ise OF), 
we have 
(9.70) lle“? Fizz = Wfllnz, V,tER, 
and 
(9.71) f € L?(R”) = e@*P) f 5 f in L?-norm, 


as t — 0. Similarly, 


(9.72) f ¢ H9(R") => c#P) f — f in H°(R"), 
and 
(9.73) f © S(R”) = e*P) fF > fF in S(R”), 


as t > 0. Furthermore, for u(t, 7) = e”P) f(a), 
(9.74) fe sS(R") > ue C°(Rx R”). 


This result, in concert with Proposition 9.4, will be of use in the derivation of the 
stationary phase method, in Appendix B of this chapter. 
In case P(D) = A, we have 


(9.75) e*45(x) (4nit + g)—7/2¢tlel?/4¢, 


= lim 

s\0 
Parallel to the 1D case, one can take initial data f to be piecewise smooth, with 
jumps across smooth surfaces, and consider the behavior of the solution to (9.59) 
as t + 0. We refer to [Tay2]—[Tay3] for results on this. (These papers also con- 
sider nonlinear Schrédinger equations.) 


A. Some Banach spaces of harmonic functions 


If B is the unit ball in R*, consider the space X, of harmonic functions f on B 
such that 


(A.1) N;(f) = sup 6(x) |f(z)| 


«eB 
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is finite, where 6(x) = 1 — |z| is the distance of x from OB. In case k = 2n 
and we identify R?” with C”, via ze = ae + 1X%n+¢, the space $); of holomorphic 
functions on B such that (A.1) is finite is a closed, linear subspace of X;. For 
results in §3, it is useful both to know that 


0 
(A.2) dep > 5 —> Djt1 


and to estimate its norm. It is just as convenient to estimate the norm of 
(A.3) Oe : XK; —> Kj41, 


where 0 = 0/Oxz¢; then the desired estimate on (A.2) will follow from that on 
(A.3). 

Given x € B, let B,(«) be the ball of radius p centered at x; take p € (0, 5(x)). 
Then, as a consequence of the Poisson integral formula for functions harmonic on 
a ball (see (3.86) of Chap. 5), we have 


Avgon (2) {(ye — te)u(y)} 


(A.4) Opu(x) = 


if u is harmonic on B. Now, for y € 0B,(x), |ye — ve| < p; furthermore, d(y) > 
6(x) — p. If we take p = 36(x), 3 € (0,1), we obtain 


k-1 _ 
|Oeu(a)| < —— - p+ [1 — 8)6(x)] 7.Nj(u) 

p 

(A.5) oo . 
= Bam pp Hey PNG (u), 

and hence 

Sai k-1 
(A.6) Nj+41(Oeu) < Ba By) N;(u), 


for u € X,;. The factor on the right is minimized at G = 1/(j + 1). Using the 
power series expansion of log(1 — €), one readily verifies that 


1 \-3 
(1- —_) “<e, 
jtl 
so, for all 7 > 0, u € X;, 


Since 0/Oze = (1/2) (Oe — t0n+2), we also have, for all 7 > 0, u € 9;, 
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0 
(A.8) Nj+1 (ae) < Yan (J + 1)N;j(u). 


Note that repeated application of (A.7) yields 
(A.9) Nm(D°u) < y~" (m!) No(u), lal =m, 


for u € Xo. This estimate of course implies the well-known real analyticity of 
harmonic functions. In order for such analyticity to follow from (A.7), it is crucial 
to have linear dependence in 7 of the factor on the right side of(A.7). The fact that 
we can establish (A.7) and (A.8) in this form also makes it an effective tool in the 
proof of the Cauchy—Kowalewsky theorem, in 84. 


B. The stationary phase method 


The one-dimensional stationary phase method was derived in §7. Here we discuss 
the multidimensional case. If M is a Riemannian manifold, F € C§°(M), and 
w € C™(M) is real-valued, with only nondegenerate critical points, there is a 
formula for the asymptotic behavior of 


(B.1) I(r) = [Fo eT¥(®) dV (x) 


as T — oo, given by the stationary phase method, which we now derive. First, 
using a partition of unity supported on coordinate neighborhoods, we can write 
(B.1) as a finite sum of integrals of the form 


(B.2) J(r) = i f(x) 7?) da, 


where f € C§°(R”) and vy has either no critical points on supp f or only one 
critical point, located at x = 0. 


Lemma B.1. [fy has no critical point on supp f, then J(r) is rapidly decreasing 
as T — oo. 


Proof. Cover supp f with open sets on which, by a change of variable, y() 
becomes linear, that is, p(a) = €-x+c,€ #0. Then J(r) is converted to a sum 
of integrals of the form 


/ fy(a)e™****" dar = 7 f;(r8), 


with fi € S(R"). If € 4 0, the rapid decrease as 7 — oo is clear. 
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It remains to consider the case of (B.2) when y() has a single critical point, at 
x = 0, which is nondegenerate. In such a case, there exists a coordinate chart 
near 0 such that 

v(x) = Az-atce, 


where A is the nonsingular, real, symmetric matrix (Aj;,) = (1/2)(0;0%(0)), 
and c € R. We can assume this holds on supp /. That this can be done is known 
as the Morse lemma; a proof is given in §8 of Appendix C. Thus it remains to 
consider 


(B.3) ce? K(r) = ef"? / g(a) AE dp, 


as T —> +00, where g € C§°(R”). Using a rotation, we could assume Ax - x = 
>> a;x?, where the factors a; are the eigenvalues of A. 

Note that if P(€) = Bé- €, where B is an invertible, symmetric, real matrix, 
then 


(B.4) et@P(D) 5(a) = (2)—"/? F(e#P) (a). 


By diagonalizing B and looking at the one-dimensional cases, as seen in 89, we 
obtain 


(B.5) 6 #P)5(x) = det (40iB) 77 t-7/? ciAw*/t, 4 = (4B), 
for t > 0, where the determinant is calculated as 


: ; .\\—1/2 
B. lim det (477(B — 
(B.6) lim det (4ri(B—ie)) 


using analytic continuation, and the convention that det(+4meI)~1/? > 0, for 
real e > 0. 

Thus, for K(7) in (B.3), we have 
(B.7) K(t-1) = C(A) t”/? u(t, 0), 


where C'(A) = det(47iB)!/?, 4B = A7', and u(t,x) solves a generalized 
Schrédinger equation: 


(B.8) u(t, 2) =e “PO g(a), 
Given g € C§°(R”), we know from material of §9 that 
u € C™([0, 00), S(R”)) C C™*([0, 00) x R”). 


Thus we have, for ¢ \, 0, an expansion 
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(B.9) u(t,0) ~ So at, 
j20 
with 
1/0\5 (—i)3 
(B.10) aj = a (5) u(0,0) = jl P(D)/g(0). 


Consequently, for (B.3) we have 
G11). Kner’ (a0 + ait) +agT 7 +--+ jae T + +00, 


where C' = C(A) is as in (B.7) and the factors a; are given by (B.10). We can 
conclude that I(7) in (B.1) is asymptotic to a finite sum of such expansions, under 
the hypotheses made on F(x) and ~(x). Let us summarize what has been estab- 
lished. 


Proposition B.2. If F € C§°(M) and w € C™(M) is real-valued, with only 
non-degenerate critical points, at £1,...,U,, then, as T —> +00, the integral 
(B.1) has the asymptotic behavior 


k 
I(r) ~ So Ag (a )r P/U), 
(B.12) d 
A,(rT) ~ ajo + ajiT + ajoT fees, 
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A 


Outline of Functional Analysis 


Introduction 


Problems in PDE have provided a major impetus for the development of func- 
tional analysis. Here, we present some basic results, which are useful for the 
development of such subjects as distribution theory and Sobolev spaces, discussed 
in Chaps. 3 and 4; the spectral theory of compact and of unbounded operators, 
applied to elliptic PDE in Chap.5; the theory of Fredholm operators and their 
indices, needed for the study of the Atiyah—Singer index theorem in Chap. 10; and 
the theory of semigroups, of particular value in Chap. 9 on scattering theory, and 
also germane to studies of evolution equations in Chaps. 3 and 6. Indeed, what is 
thought of as the subject of functional analysis naturally encompasses some of the 
development of these chapters as well as the material presented in this appendix. 
One particular case of this is the spectral theory of Chap. 8. In fact, it is there that 
we present a proof of the spectral theorem for general self-adjoint operators. One 
reason for choosing to do it this way is that my favorite approach to the spectral 
theorem uses Fourier analysis, which is not applied in this appendix, though some 
of the exercises make contact with it. Thus in this appendix the spectral theorem 
is proved only for compact operators, an extremely simple special case. On the 
other hand, it is hoped that by the time one gets through the Fourier analysis as 
developed in Chap. 3, the presentation of the general spectral theorem in Chap. 8 
will appear to be very simple too. 


1. Banach spaces 


A Banach space is a complete, normed, linear space. A norm on a linear space V 
is a positive function ||v|| having the properties 
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|av|| = lal] - |/v|| for v € V, a € C (or R), 
(1.1) |v + wll < |lvl] + lel, 


||v|| > 0 unless v = 0. 


The second of these conditions is called the triangle inequality. Given a norm on 
V, there is a distance function d(u, v) = ||u — v||, making V a metric space. 
A metric space is a set X, with distance function d : X x X — RY‘, satisfying 


d(u,v) = d(v, u), 
(1.2) d(u,v) < d(u,w) + d(w, v), 
d(u,v) > O unless u = v. 


A sequence (u,;) is Cauchy provided d(vn,Um) — 0 as m,n — oo; complete- 
ness is the property that any Cauchy sequence converges. Further background on 
metric spaces is given in §1 of Appendix B. 

We list some examples of Banach spaces. First, let X be any compact metric 
space, that is, a metric space with the property that any sequence (,,) has a con- 
vergent subsequence. Then C'(X), the space of continuous functions on X, is a 
Banach space, with norm 


(1.3) ||ul| sup = sup{|u(x)| : a © X}. 
Also, for any a € [0,1], we set 
(1.4) Lip*(X) = {u € C(X) : ju(x) — u(y)| < C d(x, y)® for all x,y € X}. 


This is a Banach space, with norm 
(1.5) [Ulla = |[ullsup + sup 
u,yEx 


Lip°(X) = C(X); the space Lip'(X) is typically denoted Lip (X). For a € 
(0,1), Lip*(X) is frequently denoted C°(X). In all these cases, it is straight- 
forward to verify the conditions (1.1) on the proposed norms and to establish 
completeness. 

Related spaces arise when X is specialized to be a compact Riemannian man- 
ifold. We have C*(M), the space of functions whose derivatives of order < k 
are continuous on M. Norms on C*(M) can be constructed as follows. Pick 
Z,,...,4n, smooth vector fields on M that span T,M at each p € M. Then 
we can set 


(1.6) [lellce = SO Zj, +++ Zell sup. 
l<k 


1. Banach spaces 595 


If one replaces the sup norm on the right by the C’°-norm (1.5), for some 
a € (0,1), one has a norm for the Banach space C**(M). 

More subtle examples of Banach spaces are the L?-spaces, defined as follows. 
First take p = 1. Let (X, 4) be a measure space. We say a measurable function f 
belongs to £1(X, 1) provided 


(1.7) / LF(@)| dyi(zr) < 00. 
x 


Elements of L1(X, ju) consist of equivalence classes of elements of £1(X, 11), 
where we say 


(1.8) frfe f(a) = f(z), for z-almost every x. 


With a slight abuse of notation, we denote by f both a measurable function in 
L'(X, ) and its equivalence class in L'(X, 1). Also, we say that f, defined only 
almost everywhere on X, belongs to L1(X, x) if there exists f € £!(X, 1) such 
that f = f ae. The norm || f|| 1 is given by (1.7); it is easy to see that this norm 
has the properties (1.1). 

The proof of completeness of L1(X, jz) makes use of the following key con- 
vergence results in measure theory. 


Monotone convergence theorem. If f; € £'(X,),0 < fi(x) < fala) <---, 
and || f;||z1 < C < oo, then limjo f;(z) = f(x), with f € L'(X,) and 
If; — fllzx + 0 as j — 00. 


Dominated convergence theorem. If f; € L'(X,),lim fj(z) = f(z), 
p-a.e., and there is an F € L1(X,) such that |f;(x)| < F(x) p-ae., for 
all j, then f € £'(X, ys) and || f; — f||z1 3 0. 


To show that L1(X, jx) is complete, suppose (f,,) is Cauchy in L’. Passing to a 
subsequence, we can assume || f,41 — fn||z1 < 2~”. Consider the infinite series 


(1.9) f(z) + > [fn4i() - fr(a)]. 


n=1 


Now the partial sums are dominated by 


Gm(x) = > |fn+1(£) a Fate 


n=1 


and since 0 < G, < Gg <--- and ||G,y||p1 < 5527” < 1, we deduce from the 
monotone convergence theorem that G,, 7 G p-a.e. and in L'-norm. Hence 
the infinite series (1.9) is convergent a.e., to a limit f(a), and via the domi- 
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nated convergence theorem we deduce that f,, > jf in L+-norm. This proves 
completeness. 

Continuing with a description of L?-spaces, we define £L°(X, 44) to consist 
of bounded, measurable functions, [°° (X, 4) to consist of equivalence classes of 
such functions, via (1.8), and we define || f|| ,-. to be the smallest sup of f ~ f. 
It is easy to show that L°°(X, 4) is a Banach space. 

For p € (1,00), we define £?(.X, j4) to consist of measurable functions f such 
that 


(1.10) [f isteopr ante)” 
x 


is finite. L?(X, 4) consists of equivalence classes, via (1.8), and the L?-norm 
|fllz2 is given by (1.10). This time it takes a little work to verify the triangle 
inequality. That this holds is the content of Minkowski’s inequality: 


(1.11) If + gllze < [lfllze + llgllze. 


One neat way to establish this is by the following characterization of the L?-norm. 
Suppose p and q are related by 


1 1 
(1.12) Side cd, 
Pp 4 
We claim that if f € L?(X, 1), 
(1.13) \[fllze = sup {|[fhllz. hh € L4(X,u), ||h|l ne = 1}. 


We can apply (1.13) to f + g, which belongs to L?(X, ) if f and g do, since 
|f + glP < 2?(|f|? + |g|?). Given this, (1.11) follows easily from the inequality 
(f+ 9)Allas < [lfllna + llghll zs. 

The identity (1.13) can be regarded as two inequalities. The “<” part can be 
proven by choosing h(x) to be an appropriate multiple C|f(x)|?~!. We leave 
this as an exercise. The converse inequality, “>,” is a consequence of Holder’s 
inequality: 


(1.14) fir@ae) d(x) < |lfllz[gllze, . 7 ; ae 


Holder’s inequality can be proved via the following inequality for positive 
numbers: 


Pp pg 
(1.15) wet gest: 
pq 
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assuming that p € (1, co) and (1.12) holds; (1.15) is equivalent to 


y 


(1.16) grylt <4 4d pyro. 
pq 


Since both sides of this are homogeneous of degree 1 in (2, y), it suffices to 
prove it for y = 1, that is, to prove that x!/? < a2/p+1/q for x € [0,00). 
Now y(a) = x!/? — x/p can be maximized by elementary calculus; one finds a 
unique maximum at x = 1, with y(1) = 1 — 1/p = 1/q. This establishes (1.16), 
hence (1.15). Applying this to the integrand in (1.14) gives 


(1.17) ; F(@)a(2)| dul) < . flee + : lallz0- 


This looks weaker than (1.14), but now replace f by tf and g by t~‘g, so that the 
left side of (1.17) is dominated by 


i Pp 1 q 
p llc» oa qlee: 


Minimizing over t € (0,00) then gives Hélder’s inequality. Consequently, (1.10) 
defines a norm on L?(X, 4). Completeness follows as in the p = 1 case discussed 
above. 

We next give a discussion of one important method of manufacturing new 
Banach spaces from old. Namely, suppose V is a Banach space, W a closed linear 
subspace. Consider the linear space L = V/W, with norm 


(1.18) Il[v] || = inf {|v — wl]: we WH, 


where v € V, and [v] denotes its class in V/W. It is easy to see that (1.18) defines 
anorm on V/W. We record a proof of the following. 


Proposition 1.1. If V is a Banach space and W is a closed linear subspace, then 
V/W, with norm (1.18), is a Banach space. 


It suffices to prove that V/W is complete. We use the following; compare the 
use of (1.9) in the proof of completeness of L'(X, i). 


Lemma 1.2. A normed linear space L is complete provided the hypothesis 


CoO 
ajEL, >°|\x5\| <0, 
j=l 


implies that pore x; converges in L. 
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Proof. If (y,) is Cauchy in L, take a subsequence so that ||yx41 — yx|| < 27*, 
and consider yi + 05° (yj+1 — Ys)- 

To prove Proposition 1.1 now, say [vj] € V/W, >> ||[v,]|| < co. Then pick 
w; € W such that |v; —w5|| < ||[v,]|| +277, to get 75°, ||vj —w,|| < 00. Hence 
>2(v; — w;) converges in V, to a limit v, and it follows that }°>[v;] converges to 
[u] in V/W. 

Note that if W is a proper closed, linear subspace of V, given v € V \ W, 
we can pick w, € W such that ||v — w,|| > dist(v, W). Normalizing v — wy 
produces uv, € V such that the following holds. 


Lemma 1.3. [f W is a proper closed, linear subspace of a Banach space V, there 
exist Un © V such that 


(1.19) vn || =1,  dist(vn,W) 71. 


In Proposition 2.1 we will produce an important sharpening of this for Hilbert 
spaces. For now we remark on the following application. 


Proposition 1.4. If V is an infinite-dimensional Banach space, then the closed 
unit ball By C V is not compact. 


Proof. If V; is an increasing sequence of spaces, of dimension j, by (1.19) we can 
obtain v; € V3, ||v;|| = 1, each pair a distance > 1/2; thus (v;) has no convergent 
subsequence. 


It is frequently useful to show that a certain linear subspace L of a Banach 
space V is dense. We give a few important cases of this here. 


Proposition 1.5. [f 4: is a Borel measure on a compact metric space X, then 
C(X) is dense in L?(X, ) for each p € [1,00). 


Proof. First, let kK be any compact subset of X. The functions 
(1.20) fien(x) = [1+ ndist(x, K)]" € C(X) 


are all < 1 and decrease monotonically to the characteristic function y% equal to 
1 on K,0on X \ K. The monotone convergence theorem gives fin — VK in 
L?(X, ) for 1 < p < oo. Now let A C X be any measurable set. Any Borel 
measure on a compact metric space is regular, that is, 


(1.21) p(A) = sup{u(ic) : K C A, K compact}. 


Thus there exists an increasing sequence ’; of compact subsets of A such that 
p(A\U; Kj) = 0. Again, the monotone convergence theorem implies yx, + 
in L?(X, 1) for 1 < p < oo. Thus all simple functions on X are in the closure of 
C(X) in L?(X, 1) for p € [1, 00). The construction of L?(X, 4) directly shows 
that each f € L?(X, 4) is anorm limit of simple functions, so the result is proved. 
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This result is easily extended to give the following: 


Corollary 1.6. [f X is a metric space that is locally compact and a countable 
union of compact Xj, and y is a (locally finite) Borel measure on X, then the 
space Coo(X) of compactly supported, continuous functions on X is dense in 
L”(X, p) for each p € [1, 00). 


Further extensions, involving more general locally compact spaces, can be 
found in [Lo]. 
The following is known as the Weierstrass approximation theorem. 


Theorem 1.7. /f I = [a, b] is an interval in R, the space P of polynomials in one 
variable is dense in C(I). 


There are many proofs of this. One close to Weierstrass’s original (and my 
favorite) goes as follows. Given f € C(J), extend it to be continuous and com- 
pactly supported on R; convolve this with a highly peaked Gaussian; and approxi- 
mate the result by power series. For a more detailed sketch, in the context of other 
useful applications of highly peaked Gaussians, see Exercises 14 and 15 in §3 of 
Chap. 3. 

The following generalization is known as the Stone—Weierstrass theorem. 


Theorem 1.8. Let X be a compact Hausdorff space and A a subalgebra of 
Cr(X), the algebra of real-valued, continuous functions on X. Suppose that 
1 € A and that A separates points of X, that is, for distinct p,q € X, there 
exists hyg € A with Rpq(p) # Npq(q). Then the closure A is equal to Cp(X). 


We sketch a proof of Theorem 1.8, making use of Theorem 1.7, which implies 
that if f € A and y: R — R is continuous, then y o f € A. Consequently, if 
fj € A, then sup(fi, fo) = (1/2)| fa — fal + (1/2)(f1 + fo) € A. 

The hypothesis of separating points implies that, for distinct p,q € X, there 
exists fg € A, equal to 1 at p, 0 at g. Applying appropriate y, we can arrange 
also that 0 < fpg(x) < 1 on X and that f,, is 1 near p and 0 near gq. Taking 
infima, we can obtain fpy € A, equal to 1 on a neighborhood of p and equal to 
O off a given neighborhood U of p. Applying sups to these, we obtain, for each 
compact K Cc X and open U > K, a function gxy € A such that gxy is 1 on 
K, 0 off U, and 0 < gxu(x) <1lonXx. 

Now, given a continuous u on X satisfying 0 < u < 1, we can set 


K={2eX:u(z)> 3}, U={eeX:u(e)> 5}, L=X\U, 


and use the result above to set g, = (1/3)gxu € A, so that 


2 
US eS 3 es 
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We can apply such reasoning, with u replaced by u — gy, obtaining go € A such 
that 


Q\2 
O<u-gm-g< (5) , on X, 
and iterate, obtaining g; € A such that 


2 
3 


k 
O<u-g1-—g2-°-:: ge < (S) , on X. 


This yields u € A whenever u € C(X) satisfies 0 < u < 1. It is an easy step 
from here to see thatu € C(X) > ue A. 

Theorem 1.8 has a complex analogue. In that case, we add the assumption that 
f € A= f € Aand conclude that A = C(X). This is easily reduced to the real 
case. 


Exercises 


1. Let £ be the subspace of C(S") consisting of finite linear combinations of the expo- 
nentials e’”°, n € Z. Use the Stone—Weierstrass theorem to show that £ is dense in 
C(S"*). 


2. Show that the space of finite linear combinations of the functions 
E(t) =e’, 


as ¢ ranges over (0,00), is dense in Co(R™), the space of continuous functions on 
R* = (0, 0c), vanishing at infinity. (Hint: Make a slight generalization of the Stone- 
Weierstrass theorem.) 

3. Given f € L'(R*), the Laplace transform 


(erg = fo esse at 
is defined and holomorphic for Re ¢ > 0. Suppose (£f)(¢) vanishes for ¢ on some 
open subset of (0,00). Show that f = 0, using Exercise 2. (Hint: First show that 
(Lf) (¢) is identically zero.) 

4. Let I be a compact interval, V a Banach space, and f : J — V acontinuous function. 
Show that the Riemann integral [, f(a) da is well-defined. Formulate and establish the 
fundamental theorem of calculus for V-valued functions. Formulate and verify appro- 
priate basic results on multidimensional integrals of V-valued functions. 

5. Let 2 C C be open, V a (complex) Banach space, and f : 22 — V. We say f is 
holomorphic if it is a Ct-map and, for each z € 92, Df(z) is C-linear. Establish for 
such V -valued holomorphic functions the Cauchy integral theorem, the Cauchy integral 
formula, power-series expansions, and the Liouville theorem. 


A Banach space V is said to be uniformly convex provided that for each e¢ > 0, these 
exists 6 > O such that, for x,y € V, 


1 
lel lull <1, |] 5@+y)] = 1-6 = Ie-yl<e. 
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6. Show that L?(X, j) is uniformly convex provided 2 < p < oo. 
(Hint: Prove and use the fact that, for a,b € C, p € [2, co), 


|a +b)? + ab)? < 2?" (lal? + |b|”), 


so that 


If + gllZ> + If — gllir < 2?-*(\FllZ> + llgllZe)-) 
Remark: L? (X, 1) is also uniformly convex for p € (1,2), but the proof is harder. See 
[Kot], pp. 358-359. 


2. Hilbert spaces 
A Hilbert space is a complete inner-product space. That is to say, first the space 


H isa linear space provided with an inner product, denoted (u,v), for u and v in 
f, satisfying the following defining conditions: 


(au, + U2,v) = a(uz,v) + (u,v), 
(2.1) (u,v) = (v, u), 


(u,u) > 0 unless u = 0. 


To such an inner product is assigned a norm, by 


(2.2) lull = V(u, u). 


To establish that the triangle inequality holds for ||u + v 


, we can expand 


Jut+v||? = (u+v,ut+v) and deduce that this is < [||u||+||o|]] * as a consequence 
of Cauchy’s inequality: 


(2.3) (u,v) < Hell - [lell, 


a result that can be proved as follows. The fact that (u — v,u —v) > 0 implies 
2 Re(u,v) < ||ul|? + ||v||?; replacing u by e’?u with e”? chosen so that e’?(u, v) 
is real and positive, we get 


1 1 
(2.4) (ue) S 5sllull? + lel: 


Now in (2.4) we can replace u by tu and v by t~1v, to get |(u, v)| < (¢/2) Jul]? + 
(1/2t)||v||?; minimizing over t gives (2.3). This establishes Cauchy’s inequality, 
so we can deduce the triangle inequality. Thus (2.2) defines a norm, as in §1, and 
the notion of completeness is as stated there. 

Prime examples of Hilbert spaces are the spaces L?(X, 1) for a measure space 
(X, 2), that is, the case of L?(X, jz) discussed in §1 with p = 2. In this case, the 


602 A. Outline of Functional Analysis 


inner product is 
(2.5) (u,v) = orc) du(a). 
x 


The nice properties of Hilbert spaces arise from their similarity with familiar 
Euclidean space, so a great deal of geometrical intuition is available. For example, 
we say u and v are orthogonal, and write u | v, provided (u,v) = 0. Note that 
the Pythagorean theorem holds on a general Hilbert space: 


(2.6) u lv => |lut+oll? = lull? + loll. 


This follows directly from expanding (wu + v,u+ v). 
Another useful identity is the following, called the “parallelogram law,” valid 
for all u,v € A: 


(2.7) lu + oll? + lu — vl? = full? + 2llo|]?. 


This also follows directly by expanding (u+v,u+v) +(u—v, u—v), observing 
some cancellations. One important application of this simple identity is to the 
following existence result. 

Let K be any closed, convex subset of H. Convexity implies that x,y €« K => 
(x + y)/2 € K. Given x € H, we define the distance from x to K to be 


(2.8) d= inf{||c—y||:y € K}. 


Proposition 2.1. If k C H is aclosed, convex set, there is a unique z © K such 
that d = ||x — z||. 


Proof. We can pick y,, € K such that ||x — y,,|| > d. It will suffice to show that 
(Yn) must be a Cauchy sequence. Use (2.7) with u = Ym — x, U = & — Yn, to get 


1 2 
lIym — Yl? = ll — all? + 2Il4m — 2)? — 4l|2 — 5m + Ym) 
Since K is convex, (1/2)(Yn+ym) € K, so ||~—(1/2)(Yn+Ym)|| > d. Therefore, 
lim sup ||%n — Ym||? < 2d? + 2d? — 4d? < 0, 


which implies convergence. 


In particular, this result applies when /¢ is a closed, linear subspace of H. In 
this case, for z € H, denote by Pxzx the point in K closest to x. We have 


(2.9) = Pee = Pee), 
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We claim that 2 — Px belongs to the linear space K+, called the orthogonal 
complement of Ix, defined by 


(2.10) K+ ={ue H: (u,v) = 0 forall v € K}. 
Indeed, take any v € K. Then 


A(t) = lz — Pex + to]? 
= || — Pall? + 2t Re (x — Pez,v) + t?|Iv||? 
is minimal at t = 0, so A’(0) = 0 (ie., Re(w — Peax,v) = 0), for allu € K. 
Replacing v by iv shows that (« — Px x, v) also has vanishing imaginary part for 
any v € K, so our claim is established. The decomposition (2.9) gives 


(2.11) e=2,+2, 216K, 2. € Kt, 


with 41 = PK2, 2 = x — Pre. Clearly, such a decomposition is unique. It 
implies that H is an orthogonal direct sum of K and K+; we write 


(2.12) H=Ko@Kt. 


From this it is clear that 


(2.13) (K+) =K, 
that 
(2.14) 2 — Pex = Pri, 


and that Px and Px + are linear maps on H. We call Px the orthogonal projection 
of H on K. Note that Px x is uniquely characterized by the condition 


(2.15) Pra € K, (Pxa,v) = (x,v), forallu € K. 


We remark that if K is a linear subspace of H which is not closed, then K+ 
coincides with kK, and (2.13) becomes (K+)~ =K. 

Using the orthogonal projection discussed above, we can establish the follow- 
ing result. 


Proposition 2.2. [fy : H — C is a continuous, linear map, there exists a unique 
f © Al such that 


(2.16) y(u) = (u, f), forallu € H. 
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Proof. Consider K = Ker yp = {u € H : y(u) = 0}, a closed, linear subspace 
of H. If K = H, then y = 0 and we can take f = 0. Otherwise, K+ 4 0; select 
a nonzero zy € K+ such that (a9) = 1. We claim K+ is one-dimensional 
in this case. Indeed, given any y € K+, y — y(y)ao is annihilated by 4, so it 
belongs to K as well as to K+, so it is zero. The result is now easily proved by 
setting f = axo with a € C chosen so that (2.16) works for u = xo, namely 
a(xo, Xo) =1. 


We note that the correspondence y +> f gives a conjugate linear isomorphism 
(2.17) H’ > H, 


where H’ denotes the space of all continuous linear maps y : H > C. 

We now discuss the existence of an orthonormal basis of a Hilbert space H. 
A set {€q : a € A} is called an orthonormal set if each ||eq|| = 1 and eg L eg 
fora 4 3. If B C Ais any finite set, it is easy to see via (2.15) that, for all 2 € H, 


(2.18) Pyx= S > (2, eae, V = span {eg : 6 € B}, 
BEB 


where Py is the orthogonal projection on V discussed above. Note that 


(2.19) S> I(x, es)? = ||Pvall? < |lall?. 
BEB 


In particular, we have (2,e,) 4 0 for at most countably many a € A, for any 
given x. (Sometimes, A can be an uncountable set.) By (2.19) we also deduce that, 
with Co = (2,€a); Daca |Ca|? < oo, and acd Cala is a convergent series in 
the norm topology of H. We can apply (2.15) again to show that 


(2.20) S- (x, €q)€q = Ppa, 


acA 


where Py, is the orthogonal projection on 
(2.21) L = closure of the linear span of {eg : a € A}. 


We call an orthonormal set {eg : a € A} maximal if it is not contained in any 
larger orthonormal set. Such a maximal orthonormal set is a basis of H; the term 
“basis” is justified by the following result. 


Proposition 2.3. An orthonormal set {eo : a € A} is maximal if and only if its 
linear span is dense in H, that is, if and only if L in (2.21) is all of H. In such a 
case, we have, for all x € H, 
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(2.22) L= S- Cala, Ca = (L,eq)- 
acA 


The proof of the first assertion is obvious; the identity (2.22) then follows from 
(2.20). 

The existence of a maximal orthonormal set in any Hilbert space can be 
inferred from Zorn’s lemma; cf. [DS] and [RS]. This existence can be established 
on elementary logical principles in case H is separable (i.e., has a countable dense 
set {y; : j =1,2,3,...}). In this case, let V,, be the linear span of {y; : 7 < n}, 
throwing out any y,, for which V,, is not strictly larger than V,,_1. Then pick unit 
e1 € Vi, unit eg € Vo, orthogonal to Vi, and so on, via the Gramm—Schmidt 
process, and consider the orthonormal set {e; : 7 = 1,2,3,...}. The linear span 
of {e;} coincides with that of {y,;}, hence is dense in H. 

As an example of an orthonormal basis, we mention 


(2.23) en. meZ, 


a basis of L?(S") with square norm |ju||? = (1/27) f.,. |w(9)|? dé. See Chap. 3, 
§3, or the exercises for this section. 


Exercises 


1. Let £ be the finite, linear span of the functions ee” n € Z, of (2.23). Use Exercise 
1 of §1 to show that L is dense in L?(S") and hence that these exponentials form an 
orthonormal basis of L?($"). 

2. Deduce that the Fourier coefficients 


ra 1 7 —in 
(2.24) Ff(n) = f(n)= x | f(0)e"" do 
give a norm-preserving isomorphism 
(2.25) F:L7(S') 5 (2), 


where €?(Z) is the set of sequences (cp), indexed by Z, such that S~ |en|? < co. 
Compare the approach to Fourier series in Chap. 3, §1. 


In the next set of exercises, let 4 and v be two finite, positive measures on a space 
X, equipped with a o-algebra B. Let a = w+ 2v andw = 2u+v. 
3. On the Hilbert space H = L*(X,q), consider the linear functional g : H > C 
given by y(f) = Jy f(a) dw(x). Show that there exists g € L?(X,a) such that 
1/2 < g(x) < 2 and 


J #@) aw(e) = | f2)g(e) dale, 


x 
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4. Suppose v is absolutely continuous with respect to py (i.e., u(S) = 0 > v(S') = 0). 


Show that {a € X : g(a) = 4} has p-measure zero, that 


na) = gE) € 1(X,n), 


and that, for positive measurable F’, 
[ro dv(x) = [Fone du(a). 
xX Xx 


5. The conclusion of Exercise 4 is a special case of the Radon—Nikodym theorem, using an 
approach due to von Neumann. Deduce the more general case. Allow v to be a signed 
measure. (You then need the Hahn decomposition of v.) Cf. [T], Chap. 8. 

6. Recall uniform convexity, defined in the exercise set for §1. Show that every Hilbert 
space is uniformly convex. 


3. Fréchet spaces; locally convex spaces 
Fréchet spaces form a class more general than Banach spaces. For this structure, 


we have a linear space V and a countable family of seminorms p; : V > R*, 
where a seminorm p; satisfies part of (1.1), namely 


(3.1) pj(av) = |alpj(v), piv + w) < pj(v) + vj (w), 


but not necessarily the last hypothesis of (1.1); that is, one is allowed to have 
p;(v) = 0 but v ¥ 0. However, we do assume that 


(3.2) v#0= > p,(v) £0, for some p;. 


Then, if we set 
(3.3) d(u, v) = ~ 2-5 eee 


we have a distance function. That d(u,v) satisfies the triangle inequality follows 
from the next lemma, with p(a) = a/(1+ a). 


Lemma 3.1. Let 5: X x X > R® satisfy 
(3.4) d(x, 2) < d(x, y) + Oy, 2), 
forall x,y,z © X. Let p:R* +R? satisfy 


p(0)=0, p'>0, p" <0, 
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so that p(a + b) < p(a) + p(b). Then 6,(x, y) = p(d(z,y)) also satisfies (3.4). 
Proof. We have 


p(0(2,z)) < p(6(a,y) + d(y,z)) < p(5(a, y)) + p(d(y, 2))- 


Thus V, with seminorms as above, gets the structure of a metric space. If it is 
complete, we call V a Fréchet space. Note that one has convergence u, — u in 
the metric (3.3) if and only if 


(3.5) Dj (Un — u) + 0asn—> oo, for each p,;. 


A paradigm example of a Fréchet space is C™ (M), the space of C'°-functions 
on a compact Riemannian manifold M. Then one can take px(u) = |lullce, 
defined by (1.6). These seminorms are actually norms, but one encounters real 
seminorms in the following situation. Suppose / is a noncompact, smooth mani- 
fold, a union of an increasing sequence M/;, of compact manifolds with boundary. 
Then C°°(M) is a Fréchet space with seminorms px(u) = ||ul|cx(ag,)- Also, 
for such M, and for 1 < p < w, LP (M ) is a Fréchet space, with seminorms 
x(t) = [lull boca, 

Another important Fréchet space is the Schwartz space of rapidly decreasing 
functions 


(3.6)  S(R") = {u€ C™®(R”) : |D%u(x)| < Cya(x)~% for all a, N}, 
with seminorms 


(3.7) pr(u) = sup (x)*|D°u(z)|. 
zER” Jal<k 


This space is particularly useful for Fourier analysis; see Chap. 3. 

A still more general class is the class of locally convex spaces. Such a space 
is a vector space V, equipped with a family of seminorms, satisfying (3.1)—(3.2). 
But now we drop the requirement that the family of seminorms be countable, that 
is, 7 ranges over some possibly uncountable set .7, rather than a countable set like 
Z*. Thus the construction (3.3) of a metric is not available. Such a space V has 
a natural topology, defined as follows. A neighborhood basis of a point x € V is 
given by 


(3.8) O@e,g)=ty eV -a@—y) <2}, o>, 


where q runs over finite sums of seminorms p;. Then V is a topological vector 
space, that is, with respect to this topology, the vector operations are continuous. 
The term “locally convex” arises because the sets (3.8) are all convex. Examples 
of such more general, locally convex structures will arise in the next section. 
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In the non-metrizable case, it is useful to have a generalization of the notion 
of a convergent sequence, x; —> x, as 7 + co in N. More generally, we have the 
notion of x. — x, when + runs over a directed set »’, that is, a partially ordered 
set with the property that 


(3.9) 7,7 € © => Ave S such that 7,7 < v. 


Given such 2’, if X is a topological space, we say x — x provided that, for each 
neighborhood O of «, there exists v € ¥/ such that x, € O whenever y > v. 

As an example of a directed set, given such X and x as above, we can take »’ 
to be a neighborhood basis of x, partially ordered by (reverse) inclusion. 


Exercises 


1. Let E be a Fréchet space, with topology determined by seminorms p,;, arranged so that 
pi < po <--+.Let F be a closed linear subspace. Form the quotient E'/F’. Show that 
E/F is a Fréchet space, with seminorms 


qj (x) = inf {p;(y): y € E, my) =z}, 


where 7 : E — E/F is the natural quotient map. (Hint: Extend the proof of 
Proposition 1.1. To begin, if g;(a) = 0 for all 7, pick b; € E such that 7(b;) = a 
and p;(b;) < 273; hence p;(bx) < 27", fork > j. Consider b: + (be — b1) + 
(bz — bo) +--+ = b € E. Show that 7(b) = a and that p;(b) = 0 for all 7. Once this 
is done, proceed to establish completeness.) 

2. If V is a Fréchet space, with topology given by seminorms {p;}, a set S C V is called 
bounded if each p; is bounded on S. Show that every bounded subset of the Schwartz 
space S(R”) is relatively compact. Show that no infinite-dimensional Banach space 
can have this property. 

3. Let I’: V — V be acontinuous, linear map on a locally convex space. Suppose K is a 
compact, convex subset of V and TA’) C K. Show that T has a fixed point in Kk. 
(Hint: Pick any vo € K and set 


1 n : 
i fe K. 
w atid Vo € 


Show that any limit point of {w,} is a fixed point of T. Note that Twn — wn = 
(T"*1u9 — v0) /(n + 1).) 


4. Duality 


Let V be a linear space such as discussed in §§ 1-3, for example, a Banach space, 
or more generally a Fréchet space, or even more generally a Hausdorff topological 
vector space. The dual of V, denoted V’, consists of continuous, linear maps 
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(4.1) w:V—+C 


(w : V — Rif V is a real vector space). Elements w € V’ are called linear 
functionals on V. Sometimes one finds the following notation for the action of 
weEV’onveV: 


(4.2) (u,w) = w(v). 


If V is a Banach space, with norm || ||, the condition for the map (4.1) to 
be continuous is the following: The set of v € V such that |w(v)| < 1 must be 
a neighborhood of 0 € V. Thus this set must contain a ball Be = {vu € V: 
||u|| < R}, for some R > 0. With C = 1/R, it follows that w must satisfy 


(4.3) |u(v)| < Clo], 


for some C’ < oo. The infimum of the C’s for which this holds is defined to be 
||w||; equivalently, 


(4.4) Ile |] = sup {|w(v)] : lull < 1. 


It is easy to verify that V’, with this norm, is also a Banach space. 

More generally, let w be a continuous, linear functional on a Fréchet space V, 
equipped with a family {p,; : 7 > 0} of seminorms and (complete) metric given by 
(3.3). For any ¢ > 0, there exists 6 >0 such that d(u,0) <6 implies |w(w)| < e. 
Take ¢ = 1 and the associated 6; pick N so large that S77, 27 < 6/2. It 


follows that a p;(u) < 6/2 implies |w(u)| < 1. Consequently, we see that the 
continuity of w : V — C is equivalent to the validity of an estimate of the form 


N 
(4.5) lu(u)| < CD pil). 


For general Fréchet spaces, there is no simple analogue of (4.4); V’ is typically 
not a Fréchet space. We will give a further discussion of topologies on V’ later in 
this section. 

Next we consider identification of the duals of some specific Banach spaces 
mentioned before. First, if H is a Hilbert space, the inner product produces a 
conjugate linear isomorphism of H’ with H, as noted in (2.17). We next identify 
the dual of L?(X, ju). 


Proposition 4.1. Let (X, 4) be a o-finite measure space. Let 1 < p < oo. Then 
the dual space L?(X, 1)’, with norm given by (4.4), is naturally isomorphic to 
L1(X, p), with l/p+1/q=1. 


Note that H6lder’s inequality and its refinement (1.13) show that there is a 
natural inclusion « : L9(X,~) — L?(X, 1)’, which is an isometry. It remains 
to show that « is surjective. We sketch a proof in the case when ju(X) is finite, 
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from which the general case is easily deduced. If w € L?(X,,)’, define a set 
function v on measurable sets FE C X by v(E) = (vz,w), where yz is the 
characteristic function of F; v is readily verified to be countably additive, as long 
as p < oo. Furthermore, v annihilates sets of ju-measure zero, so the Radon— 


Nikodym theorem implies 
[faa f tway, 


for some measurable function w. A variant of the proof of (1.13) gives w € 
L4(X, w), with |fwllc« = lll. 

Note that the countable additivity of v fails for p = 00; in fact, L°°(X, js)’ can 
be identified with the space of finitely additive set functions on the o-algebra of 
ji-measurable sets that annihilate sets of j4-measure zero. 


Remark. In the argument above, you need the Radon—Nikodym theorem for signed 
measures. The result of Exercise 4, §1 does not suffice; see Exercise 5 of §1. 


The following complement to Proposition 4.1 is one of the fundamental results 
of measure theory. For a proof, we refer to [Ru], [Yo], and Chapter 13 of [T]. 


Proposition 4.2. [f X is a compact metric space, C(X)’ is isometrically iso- 
morphic to the space M(X) of (complex) Borel measures on X, with the total 
variation norm. 


In fact, the generalization of this to the case where X is a compact Hausdorff 
space, not necessarily metrizable, is of interest. In that case, there is a distinction 
between the Borel o-algebra, generated by all compact subsets of X, and the 
Baire o-algebra, generated by the compact G's subsets of X. For M(X) here one 
takes the space of Baire measures to give C'(X)’. It is then an important fact that 
each Baire measure has a unique extension to a regular Borel measure. For details, 
see [Hal]. 

If 1M is a smooth, compact manifold, the dual of the Fréchet space C™°(M) 
is denoted D’(M) and is called the space of distributions on M. It is discussed 
in Chap. 3; also discussed there is the space S’(IR”) of tempered distributions on 
IR”, the dual of S(R”). 

For a Banach space, since V’ is a Banach space, one can construct its dual, V”’. 
Note that the action (4.2) produces a natural linear map 


(4.6) KiV3V", 


and it is obvious that ||K(v)|| < |v]. In fact, ||«(v)|| = |Jv||, that is, « is an 
isometry. In other words, for any v € V, there exists w € V’, ||w|| = 1, such that 
w(v) = ||v||. This is a special case of the Hahn—Banach theorem, stated below in 
Proposition 4.3. 
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Sometimes « in (4.6) is surjective, so it gives an isometric isomorphism of V 
with V”. In this case, we say V is reflexive. Clearly, any Hilbert space is reflexive. 
Also, in view of Proposition 4.1, we see that L?(X, yu) is reflexive, provided 
1 < p < oo. On the other hand, L'(X, jz) is not reflexive; L°(X, 1)’ is strictly 
larger than L1(X,,), except for the trivial cases where L'(X, 1) is finite- 
dimensional. 

We now state the Hahn—Banach theorem, referred to above. It has a fairly gen- 
eral formulation, useful also for Fréchet spaces and more general, locally convex 
spaces. 


Proposition 4.3. Let V be a linear space (real or complex), W a linear subspace. 
Let p be a seminorm on V. Suppose w is a linear functional on W satisfying 
|w(v)| < p(v), for v € W. Then there exists an extension of w to a linear func- 
tional 2 on V (Q.=wonW), such that |Q(v)| < p(v) forv € V. 


Note that in case V is a Hilbert space and p the associated norm, this result 
follows readily from the orthogonal decomposition established in (2.9)—(2.10). 

The key to the proof in general is to show that w can be extended to V when 
V is spanned by W and one element z € V \ W. So one looks for a constant c so 
that the prescription Q(v + az) = w(v) + ac works; c is to be picked so that 


(4.7) |w(v) + ac| < p(u+az), forue W, ae R(orC). 


First consider the case of a real vector space. Then (4.7) holds provided w(v) + 
ac < p(v + az), for allu € W, a € R, or equivalently provided 


e<a™*[p(u + az) — w(v)], 
(4.8) 
—c <a" [p(v — az) —w(v)], 


for v € W,a > 0. Such a constant will exist provided 


ae . ay" [w(v1) — p(v1 — a1z)] 
(4.9) eee 


< | me = . 
< a a3 *[p(v2 + a2z) — w(v2)] 


Equivalently, for such v; and a;, one must have 
(4.10) w(agv, + a2) < ayp(vo + a22z) + agp(v, — a1z). 
We know that the left side is dominated by 

p(agvy + a1V2) = p(agv1 — aga1z + a1a9z + ayV2), 


which is readily dominated by the right side of (4.10). Hence such a number c 
exists to make (4.7) work. 


612 A. Outline of Functional Analysis 


A Zorn’s lemma argument will then work to show that w can be extended to 
all of V in general (i.e., it has a “maximal” extension). In case V is a separa- 
ble Fréchet space and p a continuous seminorm on V, an elementary inductive 
argument provides an extension from W to a space dense in V, and hence by 
continuity to V. 

The complex case can be deduced from the real case as follows. Define y : 
W > Ras y(v) = Rew(v). Then w(v) = 7(v) — iy(iv). If 2: V > Risa 
desired real, linear extension of 7 to V, then one can set 2(v) = I'(v) — (iv). 

We now make note of some further topologies on the dual space V’. The first 
is called the weak* -topology. It is the topology of pointwise convergence and is 
specified by the family of seminorms 


(4.11) Pv(w) = |w(v)|, 
as v varies over V. The following result, called Alaoglu’s theorem, is useful. 


Proposition 4.4. If V is a Banach space, then the closed unit ball B Cc V' is 
compact in the weak* -topology. 


This result is readily deduced from the following fundamental result in 
topology: 


Theorem 4.5. If {X., : a € A} is any family of compact Hausdorff spaces, then 
the Cartesian product ||, Xq, with the product topology, is a compact Hausdorff 
space. 


Indeed, the space B C V’ above, with the weak*-topology, is homeomorphic 
to a closed subset of the Cartesian product []{X,: vu € Bi}, where B, C V is 
the unit ball in V, each X,, is a copy of the unit disk in C, and «: B > [] Xz is 
given by K(w) = {w(v) : v € By}. Fora proof of Tychonov’s theorem, see [Dug] 
and [RS]. 

We remark that if V is separable, then B is a compact metric space. In fact, if 
{v; :j € Z*} is a dense subset of By C V, the weak*-topology on B is given by 
the metric 


(4.12) d(w,o) = 5° 274|(vj,w — 0). 
j20 
Conversely, on V there is the weak topology, the topology of pointwise con- 
vergence on V’, with seminorms 


(4.13) Pu(v) = |w(v)|, wev’. 


When V is a reflexive Banach space, V = V”, then the weak topology of V 
coincides with its weak*-topology, as the dual of V’; thus Proposition 4.4 applies 
to the unit ball in V in this case. 
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More generally, we say two vector spaces V and W have a dual pairing if 
there is a bilinear form (v, w), defined for v € V, w € W, such that for each 
v #0,(v,w) 4 0 for some w € W, and for each w 4 0, this form is nonzero 
for some v € V. Then the seminorms py»(v) = |(v, w)| on V define a Hausdorff 
topology called the a(V, W)-topology, and symmetrically we have the o(W, V) 
topology on W. Thus the weak topology on V defined above is the o(V, V’)- 
topology, and the weak*-topology on V’ is the o(V’, V)-topology. 

We define another topology on the dual space V’ of a locally convex space 
V, called the strong topology. This is the topology of uniform convergence on 
bounded subsets of V. A set Y C V is bounded provided each seminorm p; 
defining the topology of V is bounded on Y. The strong topology on V’ is defined 
by the seminorms 


(4.14) py(w) =sup{|u(y)|: ye Y}, Y Cc V bounded. 


In case V is a Banach space, Y C V is bounded if and only if it is contained in 
some ball of finite radius, and then each seminorm (4.14) is dominated by some 
multiple of the norm on V’, given by (4.3). Thus in this case the strong topology 
and the norm topology on V’ coincide. For more general Fréchet spaces, such as 
V = C™(M), the strong topology on V’ does not make V’ a normed space, or 
even a Fréchet space. 

There are many interesting results in the subject of duality, concerning the 
topologies discussed above and other topologies, such as the Mackey topology, 
which we will not describe here. For further material, see [S]. 

We return to the setting of the Hahn—Banach theorem, Proposition 4.3, and 
produce some complementary results. First, instead of taking p : V > R* to 
be a seminorm, we can more generally take p to be a gauge, which is a map 
p:V +R? satisfying 


(4.15) p(av) =ap(v), Va>0, piu +w) < p(v) + p(w), 


instead of (3.1). A simple variant of the proof of Proposition 4.3 gives the 
following. 


Proposition 4.6. Let V be a real linear space, W a linear subspace. Assume p is 
a gauge on V. Ifw: W — Risa linear functional satisfying w(v) < p(v), for 
uv € W, then there is an extension of w to a linear functional 2 on V, such that 
2(v) < p(v) forallu EV. 


Note that the conclusion gives 2(—v) < p(—v), hence |Q(v)| < pv) = 
max(p(v), p(—v)), so 92 is continuous if p(v) is dominated by a seminorm that 
helps define the topology of V. 

Here is an example of a gauge. Let V be a locally convex space and O a convex 
neighborhood of 0 € V. Define po : V  R* by 


(4.16) po(v) = inf {a >0:a7'v Ee O}. 
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This is called the Minkowski gauge of O. This object will take us from Proposition 
4.6 to the following result, known as the separating hyperplane theorem. 


Proposition 4.7. Let V be a locally convex space (over R), and let ki, Kg C V 
be disjoint convex sets. 


(i) If xy is open, then Kk, and K2 can be separated by a closed hyperplane. 
(ii) If ky and Ky are both open, they can be strictly separated by a closed 
hyperplane. 
(iii) If ky is compact and K is closed, they can be strictly separated by a closed 
hyperplane. 


Here (i) means there exists a continuous linear functional 2: V — Randa 
number a € R such that 2(v1) < a < (v2) for all v; € K;, and (ii) means there 
exist such (2 and a with the property that 2(v1) < a < (v2) for all vu; € K;. 
The separating hyperplane is given by {v € V : 2(v) = a}. 


Proof. For (i), pick w € Ky — Ki = {vg — v1 : vj; € Kj}, and let O = 
K, — Ky + w. Then O is an open, convex neighborhood of 0, and w ¢ O. 
Let p = po be the associated Minkowski gauge, and define w on Span(w) by 
w(aw) =a. Since w ¢ O, p(w) > 1, so w(aw) < p(aw) for all a > 0, hence for 
all a € R. By Proposition 4.6, w can be extended to a continuous linear functional 
§2 on V such that 2(v) < p(v), for all v € V. Hence 2(v) < 1 for all uv € O. 
Thus, for each v; € K;, 


2(v1) < Q(ve) + (1 — w(w)). 
But w(w) = 1, so 
(4.17) Av1) < Av), Vv; € Kj. 


This proves (i). 

For (ii), take 2 as in (i). If A; is open {2(v) : v € K;} is readily verified so 
be an open subset of IR. So we have two open subsets of IR, which by (4.17) share 
at most one point. They must hence be disjoint. 

In case (iii), consider C = K2 — Kj. Disjointness implies 0 ¢ C. Since Ky is 
compact, C' is closed. Thus there is an open, convex neighborhood U of 0, disjoint 
from C’. Let Ky = Ky + (1/2)U and Ky = K2 — (1/2)U. Then Ky and K2 are 
disjoint, open, convex sets, and (ii) applies. Any closed hyperplane that strictly 
separates kK, and Ko also strictly separates Ky and kK. 


Proposition 4.7 has the following important topological consequence. 


Proposition 4.8. Let Kk be a closed, convex subset of the locally convex space V 
(over R). Then K is weakly closed. 
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Proof. Suppose va € K and va — v weakly, that is, Q(va) + 2(v) for all 
2Q€V'.Ifu ¢ K, this contradicts the conclusion of (iii) of Proposition 4.7 (with 
K, = {v}, Ko = K),sove K. 


Note. If V is a linear space over C with a locally convex topology, its weak topol- 
ogy coincides with that produced by regarding V as a linear space over R. 


Proposition 4.8 interfaces as follows with Proposition 4.4. 


Proposition 4.9. Let V be a reflexive Banach space and K C V a closed, 
bounded, convex set. Then K is compact in the weak topology. 


Proof. Proposition 4.4, with V and V’ switched, implies that each closed ball 
Br C V is compact in the weak topology (which coincides with the weak* 
topology by reflexivity). The hypotheses imply K C Br for some R, and, by 
Proposition 4.8, Kv is a closed subset of Br, in the weak topology. 


Exercises 


1. Let ©’ = {7} be a directed set, as described at the end of the previous section, V a 
locally convex space, with dual V’, and u, u € V’ Show that 


Uy > u, weak™ <=> (v,u+) > (v,u), Vu EV. 


2. Suppose {u; : 7 € Zt} is an orthonormal set in a Hilbert space H. Show that uj; — 0 
in the weak™ topology as 7 — oo. 

3. In the setting of Exercise 1, suppose H = L*(X, 1), and the u; also satisfy uniform 
bounds: |u;(ax)| < 1M. Show that u; — 0 in the weak” topology of L°°(.X, js), as the 
dual to L*(X, 1). 

4. Deduce that if f € L1(S*), with Fourier coefficients f(n) given by (2.24), then 
f(n) + Oasn = o. 

5. Prove the assertion made in the text that, when V is a separable Banach space, then 
the unit ball B in V’, with the weak* topology, is metrizable. (Hint: To show that 
(4.12) defines a topology coinciding with the weak™ topology, use the fact that if y : 
X — Y is continuous and bijective, with X compact and Y Hausdorff, then ¢ is a 
homeomorphism.) 

6. Ona Hilbert space H, suppose f; — f weakly. Show that if 


(4.18) || f|| = lim sup || f;ll, 
j-oco 


then f; — f in norm. (Hint: Expand (f — fj, f — f;)-) 

7. Extend Exercise 5 as follows. Let V be a uniformly convex Banach space (cf. §1, Exer- 
cise 6). Suppose f;, f € V and f; — f weakly. Show that if (4.18) holds, then f; — f 
in norm. (Hint. Assume || f|| = 1. Take w € V" such that ||w|| = 1 and (f,w) = 1. 
Investigate implications of 


(FEF) — (hu), as j — 00, 
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10. 


11. 


12. 


13. 


14. 


in concert with (4.18).) 


. Suppose X is a closed, linear subspace of a reflexive Banach space V. Show that X is 


reflexive. (Hint: Use the Hahn—Banach theorem. First show that X’ ~ V'/X~+, where 
X+ ={w €V':w(v) =0,V v € X}. Thus, a bounded linear functional on X’ gives 
rise to a bounded linear functional on V’, annihilating X+.) 


. Let V be a C-linear space, and let a : V — R be R-linear. Define G : V — C by 


B(v) = a(v) — ia(iv). Show that @ is C-linear. 

Suppose V = H is a Hilbert space, K C H aclosed, convex subset, and v ¢ K. As 
an alternative to Proposition 4.7, use Proposition 2.1 to produce a closed hyperplane 
strongly separating K and v. Apply this to Propositions 4.8 and 4.9, in case V is a 
Hilbert space. 


Let V be acomplex Banach space, {2 C C an open set. A continuous function F’ : 2 
V is said to be weakly holomorphic if, for each w € V’, the function z +> (F(z), w) is 
a complex-valued holomorphic function on §2. In the following exercises, assume F' is 
weakly holomorphic on 92. 

If © is a smoothly bounded open set and © C , prove the following version of the 


Cauchy integral theorem, 
[ FOw=0, 
ao 


and the Cauchy integral formula, 


raj== 


Oni 


[ro (¢—z) ‘dé, zEO. 


00 


Let Dr(Zo) be the disk of radius R, centered at zo, 7 its boundary. Assume Dr(zo) C 
92. Expand (¢ — z)~' ina power series about zo and show that 


F(z) =) _Gx(z- 20)", for z € Dr(20), 
k=0 


with 


1 


Ge=s— | F(Q(G- 20)“ ‘doe V, Gell < sup IFC R“. 
ey 


Ori 
ef 
Formulate the result that if a continuous F’ : (2 — V is weakly holomorphic, then it is 


holomorphic. 
Let V be a Banach space. Show that V’ separable = V separable. 
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If V and W are two Banach spaces, or more generally two locally convex spaces, 
we denote by £(V,W) the space of continuous, linear transformations from V 
to W. As in the derivation of (4.4), it is easy to see that, when V and W are 
Banach spaces, a linear map T’: V — W is continuous if and only if there exists 
a constant C’ < oo such that 
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(5.1) Tvl] < Cle 


for all v € V. Thus we call T' a bounded operator. The infimum of all the C’s for 
which this holds is defined to be ||T'||; equivalently, 


(5.2) || = sup {||| : [lvl] < 1}. 


It is clear that C(V, W) is a linear space. If V and W are Banach spaces and 
T; € L(V,W), then ||T1 + Ta|| < ||Ti|| + ||T2||; completeness is also easy to 
establish in this case, so L(V, W) is also a Banach space. If X is a third Banach 
space and S € L(W, X), it is clear that ST € L(V, X), and 


(5.3) IST] < [S|] - (IZ. 


The space L(V) = L(V,V), with norm (5.2), is a Banach algebra for any 
Banach space V. Generally, a Banach algebra is defined to be a Banach space B 
with the structure of an algebra, so that, for any S,T © B, the inequality (5.3) 
holds. We say a little more about Banach algebras at the end of this section. 

If V and W are Banach spaces and T € L(V,W), then the adjoint T’ € 
L(W’', V’) is uniquely defined to satisfy 


(5.4) (Tv, w) = (v,T’'w), vEV,wew’. 
Using the Hahn—Banach theorem, it is easy to see that 
(5.5) (T= IT", 


both norms being the sup of the absolute value of (5.4) over ||v|| = 1, ||w|| = 1. 
When V and W are reflexive, it is clear that T’” = T’. We remark that (5.4) also 
defines T’ for general locally convex V and W. 

In case V and W are Hilbert spaces and T € L(V, W), then we also have an 
adjoint T* € L(W,V), given by 


(5.6) (Tv,w) =(v,T*w), vEVvV,wew, 


using the inner products on W and V, respectively. As in (5.5) we have ||T'| = 
||Z*||. Also it is clear that T** = T. 

When H is a Hilbert space, the Banach algebra £(#) is a C*-algebra. Gen- 
erally, a C*-algebra B is a Banach algebra, equipped with a conjugate linear 
involution T +> 7", satisfying ||7™*|| = ||Z'|| and 


(5.7) 7" 7 || = ITI. 
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To see that (5.7) holds for T € £(#), note that both sides are equal to the sup of 
the absolute value, over ||v1|| <1, ||va|| < 1, of 


(5.8) (T* Tu, v2) = (Tv1, Tv2), 


such a supremum necessarily being obtained over the set of pairs satisfying 
U1, = vg. Note that C’(X), considered above, is also a C*-algebra. However, for a 
general Banach space V, £(V) will not have the structure of a C*-algebra. 

We consider some simple examples of bounded linear operators. If (X, j) is 
a measure space, f € L™(X, 1), then the multiplication operator M;, defined 
by Myu = fu, is bounded on L?(X, 4) for each p € [1,00], with ||M/;|| = 
\|f || L-e. If X is a compact Hausdorff space and f € C(X), then Mp € L(C(X)), 
with ||M|| = || f||sup- In case X is a compact Riemannian manifold and P is a 
differential operator of order & on X, with smooth coefficients, then P does not 
give a bounded operator on C'(X’), but one has P € L(C*(X), C(X)), and more 
generally P € L(C*+™(X),C™(X)), form > 0. For related results on Sobolev 
spaces, see Chap. 4. 

Another class of examples, a little more elaborate than those just mentioned, is 
given by integral operators, of the form 


(5.9) Ku(a) = / k(2,y) u(y) duly), 


x 


where (X, j4) is a measure space. We have the following result: 


Proposition 5.1. Suppose k is measurable on X x X and 
1) [h(e.y)l due) < Cr, f [h(e.¥)| duly) < Ca, 
x. Xx 


for all y and for all x, respectively. Then (5.9) defines IX as a bounded operator 
on L?(X, 1), for each p € [1, oo], with 


1 
(5.11) [eae a, Pee aa 


Proof. For p € (1, 00), we estimate 
(5.12) ff ee v)so)a(o) du(x) duly) 
xX X 


via the estimate ab < a?/p + b%/q of (1.15), used to prove Hélder’s inequality. 
Apply this to | f(y) g(a)|. Then (5.12) is dominated by 


Ci C2 
(5.13) —||f p> + —llgllt, 
5 fll , hall 
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provided (5.10) holds. Replacing f, g by tf, t~1g, we see that (5.12) is dominated 
by (Cit? /p)|| fll ,2 +(C2/qt*)||g||4,.; minimizing over t € (0, 00), via elementary 
calculus, we see that (5.12) is dominated by 


(5.14) 070s" f|lzellglizce, 


proving the result. The exceptional cases p = 1 and p = oo are easily handled. 


We call k(x, y) the integral kernel of K. Note that K’ is an integral operator, 
with kernel k’(x, y) = k(y, x). In the case of the Hilbert space L?(X, 1), K* is 
an integral operator, with kernel &* (x,y) = k(y, x). 

Chapter 7 includes a study of a much more subtle class of operators called 
singular integral operators, or pseudodifferential operators of order zero; L?- 
estimates for this class are made in Chap. 13. 

We next consider some results about linear transformations on Banach spaces 
which use the following general result about complete metric spaces, known as 
the Baire category theorem. 


Proposition 5.2. Let X be a complete metric space, and X;, j € Z*, nowhere- 
dense subsets; that is, the closure Xj contains no nonempty open set. Then 


U; Xj # X. 


Proof. The hypothesis on X 1 implies there is a closed ball B,.,(pi) C X \ X1, 
for some p; € X, 1, > 0. Then the hypothesis on X» gives a ball B,.,(p2) C 
B,,(pi) \ X2,0 < re < 11/2. Continue, getting balls 


Ga) Br, (pj) C Brj_y(pj-1) \ Xj, O< ry S27 ry. 
Then (p;) is Cauchy; it must converge to a point p ¢ U; X;, as p belongs to each 
Br; (pj). 


Our first application is to a result called the uniform boundedness principle. 


Proposition 5.3. Let V,W be Banach spaces, T; € L(V,W), 7 € Zt. Assume 
that for each v € V, {T;v} is bounded in W. Then {\|T;||} is bounded. 


Proof. Let X = V. Let X, = {v € X : ||Tjv|| <n for all 7}. The hypothesis 
implies U, X, = X. Clearly, each X,, is closed. The Baire category theorem 
implies that some Xy has nonempty interior, so there exists v9,7r > 0 such that 
llul| < r => ||Z;(wo + v)|| < N, for all 7. Hence 


(5.16) lull Sr > ITjeoll SN + [Till SR VG, 


using the boundedness of {T;vo}. This implies ||Tj|| < R/r, completing the 
proof. 


The next result is known as the open mapping theorem. 
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Proposition 5.4. [If V and W are Banach spaces and T € L(V, W) is onto, then 
any neighborhood of 0 in V is mapped onto a neighborhood of 0 in W. 


Proof. Let B,denote the unit ball in V, X,, = T(nB,) = nT(B,). The hypoth- 
esis implies L,,., Xn = W. The Baire category theorem implies that some XN 
has nonempty interior, hence contains a ball B,.(wo); symmetry under sign change 
implies X y also contains B,.(—wo). Hence X2v = 2X y contains Bo,(0). By 
scaling, X , contains a ball B.(0). Our goal now is to show that Xj itself contains 
a ball. This will follow if we can show that X1 C Xo. 

So let y € X; = T(B,). Thus there is an 2; € By such that y— Tx, € 
Bz/2(0) C X1/2. For the same reason, there is an x2 € By/2 such that (y — 
Tx,) — Txy € Bz 4(0) C X14. Continue, getting x,, € By:—n such that 


y— )_ Tx; € Be/zn(0). 


j=l 
Then z = Pee x; is in Bg and Tx = y. This completes the proof. 


Corollary 5.5. If V and W are Banach spaces and T : V — W is continuous 
and bijective, then T~' : W — V is continuous. 


In such a situation, we say that T’ is a topological isomorphism. 
The third basic application of the Baire category theorem is called the closed- 
graph theorem. For a given linear map T' : V — W, its graph is defined to be 


(5.17) Gr = {(v,Tv) EV OW: vEV}. 


It is easy to see that, whenever V and W are topological vector spaces, then if T’ 
is continuous, G'r is closed. The following is a converse. 


Proposition 5.6. Let V and W be Banach spaces, T : V — W a linear map. If 
Gr is closed in V ® W, then T is continuous. 


Proof. The hypothesis implies that Gr is a Banach space, with norm ||(v, T'v)|| 
= |lv|| + |/Zvl]. Now the maps J : Gp >— V, K : Grp — W, given by 
J(v,Tv) = v, K(v,Tv) = Tv, are clearly continuous, and J is bijective. Hence 
J—' is continuous, and so T = KJ~—! is also continuous. 


Propositions 5.3-5.6 have extensions to Fréchet spaces, since they are also 
complete metric spaces. For example, let V be a Fréchet space in Proposition 5.3 
(keep W a Banach space). In this case, the hypothesis that {T;v} is bounded in 
W for each v € V implies that there exists a neighborhood O of the origin in V, 
of the form (3.8), such that v € O = ||T;v|| < 1 for all j, that is, for some finite 
sum q of seminorms defining the Fréchet space structure of V, 
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(5.18) |Zjull < K q(v), for all j, 


with Ke independent of 7. 

Propositions 5.4—5.6 extend directly to the case where V and W are Fréchet 
spaces, with only slight extra complications in the proofs. 

We now give an important application of the open mapping theorem, to a result 
known as the closed-range theorem. If W is a Banach space and L C W isa linear 
subspace, we denote by L+ the subspace of W’ consisting of linear functionals 
on W that annihilate L. 


Proposition 5.7. If V and W are Banach spaces and T € L(V,W), then 
(5.19) KerT’ =T(V)+. 
If, in addition, T(V) is closed in W, then T’(W’) is closed in V' and 
(5.20) T'(W’) = (Ker T)+. 
Proof. For the first identity, by (Tv, w) = (v,T’w), it is obvious that T(V)+ = 
Ker T’. This gives (5.19). 
It takes more work to establish (5.20). As a preliminary, we note that the iden- 
tity (v, T’w’) = (Tv, w’) readily implies 
(5.21) T’ : W' — (KerT)t. 


From here, the argument proceeds by writing T’ as a composition of three opera- 
tors: 


w' 7, (KerT)+ 

(5.22) 7 4b ta 
T(Vy 2s (V/KerTy! 

These are gotten by taking the adjoint of 
(5.23) T=joTon, 
where the factors are given as follows: 
(5.24) mt: V—>V/Ker T 
is the natural projection, 


(5.25) T:V/Ker T—+T(V), T(v mod Ker T) = Tv, 
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and 
(5.26) g:T(V) —W 


is the natural inclusion. The hypothesis that T has closed range implies that the 
maps in (5.25) and (5.26) are maps between Banach spaces. To verify (5.23), 
note that T(zv) = Tv by the definition of T’, and 7 is the identity map. Hence 
T=n'oT'of'. 

Thus the proof of (5.20) will be accomplished if we establish the following: 


(5.27) j': W' —+ T(V)’ is surjective, 
(5.28) T' : T(V) > (V/Ker TY’, 
(5.29) nm’: (V/Ker T)’ > (Ker T)+. 


We establish these results in more general settings. First, (5.27) follows from: 


Lemma 5.8. /f X is a closed linear subspace of a Banach space W and j : X > 
W the inclusion, then j’ : W' — X’ is surjective. 


Proof. Given x’ € X’, i.e., x’ : X — C, the Hahn-Banach theorem provides an 
extension w’ : W — C, and then a’ = j’(w’). 


To establish that (5.28) holds, we note that 
(5.30) T : V/Ker T —+ T(V) is bijective, 
and a continuous linear map between Banach spaces, so (5.28) follows from: 


Lemma 5.9. If X and Y are Banach spaces and A € L(X,Y) is bijective from 
X to Y, then A! : Y' — X’ is an isomorphism. 


Proof. It follows from the open mapping theorem that B = A~!: Y > X is 
continuous. Hence B’ € £(.X’, Y’). Now we have B’A’ = I and A’B’ = I, so 
Lemma 5.9 is proven. 


Finally, (5.29) follows from: 


Lemma 5.10. /f X is aclosed linear subspace of a Banach space V andr : V + 
V/X is the natural projection, then 


(5.31) WV RY 
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Proof. The isomorphism (5.31) follows from the fact that continuous linear func- 
tionals on V/X correspond precisely with continuous linear functionals on V that 
annihilate X, which is the definition of Xt. 


REMARK. In the Hilbert space case, we have the same result for 7. 


Since one frequently looks at equations T’u = uv, it is important to consider 
the notion of invertibility. An operator T € £(V,W) is invertible if there is an 
S € L(W,V) such that ST and T'S are identity operators. One useful fact is that 
all operators close to the identity in £(V) are invertible. 


Proposition 5.11. Let V be a Banach space, T € L(V), with ||T|| < 1. Then 
I — T is invertible. 


Proof. The power series 7° , T’” converges to (I — T)~'. 


When V is a Banach space, we say ¢ € C belongs to the resolvent set of 
an operator T € L(V) (denoted p(T)) provided ¢I — T is invertible; then the 
resolvent of T' is 


(5.32) Re = (CI -T)™*. 
It easily follows from the method of proof of Proposition 5.11 that the resolvent set 
of any T € L(V) is open in C. Furthermore, R¢ is a holomorphic function of ¢ € 


p(T). In fact, if Co € p(T), then we can write ¢—T = (Go) -T)(I-(Go-4) Re), 
and hence, for ¢ close to Co, 


Re = Re DRE (Go - 0". 
n=0 


It is also clear that ¢ belongs to the resolvent set whenever |¢| > ||T'||, since 


(5.33) (C=7)- oq -r=e-7) 


The complement of the resolvent set is called the spectrum of T’. Thus, for 
any T € L(V), the spectrum of T (denoted o(T)) is a compact set in C. By 
(5.33), || Re|| + 0 as |¢| + oo. Since Re is holomorphic on p(T), it follows by 
Liouville’s theorem that, for any T € L(V), p(T) cannot be all of C, so o(T) is 
nonempty. 

Using the resolvent as a tool, we now discuss a holomorphic functional cal- 
culus for an operator T € L(V), and applications to spectral theory. Let 2 be a 
bounded region in C, with smooth boundary, containing the spectrum o(T) in its 
interior. If f is holomorphic on a neighborhood of §2, we set 
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1 = 
(5.34) HE) aes \ FQ (C-T) de, 
a) 
where y = O§2. Note that if J’ were a complex number in {2, this would be 


Cauchy’s formula. Here are a couple of very basic facts. 


Lemma 5.12. If f(z) = 1, then f(T) = I, and if f(z) = z, then f(T) = T. 
More generally, if k € N and f(z) = z*, then f(T) = T*. 


Proof. Deform ¥ to be a large circle and use (5.33), plus 
(5.35) (I-¢T) + =1I+ 5 0(¢"'T)”. 


The following allows us to represent f(T’) as a convergent power series in T 
when f is holomorphic on a disk D (0) containing o(T). It also applies to much 
more general situations. 


Proposition 5.13. Take T € L(V). Assume fi, and f are holomorphic on a 
neighborhood O of o(T) and fy, — f uniformly on O. Then f(T) > f(T) 
in norm. 


Proof. Take a smoothly bounded open set 2 such that o(T) C QC 2 C O. For 
g holomorphic on O, (5.34) implies 


(5.36) gD) <C sup [FO 
(a=ere) 
with 
L(0 
(5.37) c= OO op RCI. 
T ¢Ean 


Applying this estimate to g = f — fx gives || f(T) — fr(T)|| > 0. 


We next derive a multiplicative property of this functional calculus, making 
use of the following result, known as the resolvent identity. 


Lemma 5.14. /f z,¢ € p(T), then 
(5.38) fi, He= (¢ — 2) Aye. 
Proof. For any ¢ € p(T’), Re commutes with ¢ — T, hence with T, hence with 


any z — T. If, in addition, z € p(T), we have both Re R.(z — T) = Re and 
R,Re(z—T) = R.(z —T)Re = Re, hence 
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(5.39) R,Re = RR. 


Thus 
R,-Re =(€-T)ReRz — (2 -T)RR¢ 


= (¢ = 2) ReR, 
proving (5.38). 


Now for our multiplicative property: 


Proposition 5.15. If f and g are holomorphic on a neighborhood of 22, then 


(5.40) FL) g(T) = (f9)(7). 


Proof. Let y = 0, as above, and let 7; be the boundary of a slightly larger 
region, on which f and g are holomorphic. Write 


and hence, using (5.34), write f(T)9(T) as a double integral. The product R¢ R, 
of resolvents of T’ appears in the new integrand. Using the resolvent identity 
(5.38), we obtain 


Ga) f(T)g(T)=- ze f [C= 2M Oale\(R. = Ro) a de. 


The term involving R, as a factor has d¢-integral equal to zero, by Cauchy’s 
theorem. Doing the dz-integral for the other term, using Cauchy’s identity 


1 
(6) = gy [6 *91@) de, 
Yl 
we obtain from (5.41) 
1 
(5.42) F(D)g(D) = 5 f Hoge) Re a, 
a 


which gives (5.40). 


One interesting situation that frequently arises is the following. {2 can have 
several connected components, {2 = (2; U---U Qi, each 2; containing different 
pieces of o(T). Taking a function equal to 1 on 2; and 0 on the other components 
produces operators 
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(5.43) R= = KG —T)"' dl, yj =ON;. 
V5 
By (5.40) we see that 
(5.44) Pak, Fey, tory sk, 
so P,,..., Pig are mutually disjoint projections. By Lemma 5.12, P; +--+ + 


Py = I. It follows easily that if 7; denotes the restriction of T to the range of 
P;, then 


(5.45) o(T;) = o(T) 1 Q;. 


We next relate the spectrum of f(T) to that of T. 


Proposition 5.16. If T € L(V), f is holomorphic on a nieghborhood of o(T), 
and z € f(o(T)), then z — f(T) is invertible. 


Proof. Set 


1 
(5.46) gz(¢) = ZO” holomorphic in ¢ on a neighborhood of o(T)). 
z— 


The multiplicativity gives 


(5.47) g(T)(z — F(L)) = (2— F(P))92(T) = I. 


Another way to phrase Proposition 5.16 is that 


(5.48) o(f(T)) C f(o(T)). 


This is completed by the following result, known as the spectral mapping theorem 


Proposition 5.17. In the setting of Proposition 5.16, 


(5.49) o(f(T)) = f(o(T)). 

Proof. Say f is holomorphic on a neighborhood 2 of o(T). Taking \ € o(T), 
we have 

(5.50) F(T) — FA) = © - AGT) = Ga(T)(T — A), 

where 

(5.51) Ga(¢) = me - _ 


5. Linear operators 627 


which is holomorphic in ¢ € 2 (with a removable singularity at ¢ = ). Clearly 
if \ € o(T), the right side of (5.50) is not invertible. This yields 


(5.52) rN € o(T) = f(A) € ao(f(T)), 


which together with (5.48) gives (5.49). 


A natural adjunct to the spectral mapping theorem is the following composition 
identity. 


Proposition 5.18. Given T € L(V), f holomorphic on a neighborhood of o(T), 
and h holomorphic on a neighborhood of f(a(T)) (so ho f is holomorphic on a 
neighborhood of o(T)), we have 


(5.53) (ho f)(T) = h(f(T)). 


Proof. There is no loss in assuming o(T) C 2, f holomorphic on a neighbor- 
hood of (2, and h holomorphic on a neighborhood of f(Q). 
First, for ¢ € §2, 7 the boundary of some neighborhood of f (2), we have 


(ho A)(C) = MA) = se f Wlayle= FQ de 

(5.54) ‘ * 
= Daeg (z)g2(¢) dz, 

where, as in (5.46), 
(5.55) ge(¢) = =e: 
Hence 

(ho ND) = 5 f (ho Noe = TY ag 
(5.56) — 

= cap | / h(2)ge(C)(¢ - T)7} ded. 


Reversing the order of integration (doing the d¢-integral first) gives 
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(ho Nt) = == | h(e)g.(0) de 
a = = i h(z)(z— f(T))* dz (by 5.47) 
=h(f(T)), 


as desired. 


EXAMPLE. For € L(V), e4 = exp(A) is given (cf. Proposition 5.13) by the 


power series 
1 
(5.58) ea > A’. 


Assume that T € £(V) and 
(5.59) o(T) C 2CC\0, open, simply connected. 


Then there is a holomorphic function L : (2 — C that is an analytic continuation 
of log, so e&) = ¢, ¢ € Q. Proposition 5.18 then implies that 


(5.60) el @) — 7. 


Banach algebras 


Much of the material presented above on the space £(V) of bounded linear 
operators on a Banach space V has a natural extension to a more abstract setting, 
of Banach algebras. We sketch some of the basics here. A Banach algebra B is a 
Banach space (over C), equipped with a product (making B an algebra over C), 
satisfying 


(5.61) Ilzy|| < lla] lla. 
We say G has a unit J if I € B satisfies 
(5.62) Iex=al=2, VareBb, |I\)=1. 
Banach algebras arise in a variety of settings. Examples include £(V) (with 
the operator norm), C'(X’), the space of continuous functions on a compact space 


(with the sup norm), the space of functions on the circle S! with absolutely 
summable Fourier series, 
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(5.63)  A(S*) = {FE C(S*): DUI) < co}, IFl= DUIF@)L 


where f (k) are the Fourier coefficients of f, and many others, such as algebras 
of bounded holomorphic functions on a complex domain, or closed subalgebras 
of such alebras as mentioned above, and also quotients of such algebras by closed 
ideals (more on which below). It is useful to have a general theory that encom- 
passes such examples. 

Here we merely mention some basic aspects of such a theory. One theme cen- 
ters on the question of when an element x of a Banach algebra B is invertible, i.e., 
when there exists 2~' € B such that rx~ = x~!x = I. As for L(V), we start 
with the following simple observation. Let y € B. Then 


(5.64) lly|| < 1 => I — y is invertible. 
In fact, 
(5.65) (-y) 7 = dig. 

k=0 


To proceed, we have he notion of the resolvent set p(a) and the spectrum a(x) of 
x € B, defined as follows. For ¢ € C, 


(5.66) ¢ € p(x) & ¢ — xisinvertible, o(x) =C\ p(x). 


Just as for £(V), we have that p(x) is open, and Re = (¢ — x)~! is holomorphic 
in ¢ € p(x), and that o(2) is closed, bounded, and nonempty. In fact, 


(5.67) o(a) C{C EC: |¢] < lal}. 
More specifically, if r(~) = sup{|¢| : ¢ € a(x) denotes the spectral radius of x, 
there is a formula generalizing that given in Exercise 9 below. 

One interesting class of Banach algebras, advertised above, arises as follows. 


Let B be a Banach algebra with unit, and let Z C B be a closed, 2-sided, proper 
ideal. Then the quotient 6/T is a Banach space, with norm 


(5.68) ||[z]|| = inf{||a — z|| : 2 € Z}. 


It has a product: 


[x], [yl € B/E = [a]ly] = (w@ + Z)(y+T) 
(5.69) =cyt+al+Zy+Z2 
= [zy]. 


Furthermore, one readily verifies that ||[Z]|| = 1 (via (5.64)) and 
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(5.70) IIel[yll < Welt tM. 


so B/T is a Banach algebra. An example of this, called the Calkin algebra, arises 
in the study of Fredholm operators in §7. 

In fact, our only use of abstract Banach algebras in this text is to the theory of 
Fredholm operators, via the Calkin algebra. We refer to [Yo], [Lo], and [Dou] for 
further material, such as the Gel’fand theory of commutative Banach algebras and 
applications to harmonic analysis. 


Exercises 


1. Extend the p = 2 case of Proposition 5.1 to the following result of Schur. Let (X, ju) 
and (Y, ’) be measure spaces, and let k(x, y) be measurable on (X x Y, xv). Assume 
that there are measurable functions p(x), q(y), positive a.e. on X and Y, respectively, 
such that 


(5.71) howe x) d(x) < Craly ) fie vate ) dv(y) < Cap(z). 


Show that Ku(x) = fy k( y) dv(y) defines a bounded operator 
K:L7(¥,v) > L(X,u), ||K\? < CiCe. 


Give an appropriate modification of the hypothesis (5.71) in order to obtain an operator 
bound Kk : L?(Y,v) > L?(X, p). 

2. Show that k(a, y) is the integral kernel of a bounded map K : L?(IR”) > L?(R%) 
provided it has support in {x1, yi € [0, 1]} and satisfies the estimate 


—n/2 
(5.72) \e(x,y)| <C (le! -y'? +27 +47) » t= Gy2), y= Wny)- 


(Hint: Construct p(x) and q(y) so that (5.71) holds. Here, R? = {a € R” : x > O}.) 
3. Show that k(a, y) is the integral kernel of a bounded map K : L?(R%) — L?(R%), 
for 1 < p < o, provided it has support in {1, yi € [0,1]} and satisfies the estimates 


—(n+1) 
le(x,y)| < Car(lar +911 + |e’ —y'l) 


and 
/ 1\ 7 (e+1) 
lk(x,y) <Cyi(lertyltle’—yl) 

4. Let K be a closed, linear subspace of a Banach space V; consider the natural maps 
j: K — Vanda: V > V/K. Show that j’ : V’ — K’ is surjective and that 
nw’: (V/K)' — V' has range K+. 

5. Show that the set of invertible, bounded, linear maps on a Banach space V is open in 
L(V). (Hint: If T~* exists, write T+ R = T(I+T7'R).) 

6. Let X be a compact metric space and F' : X — X a continuous map. Define T’ : 
O(X) = C(X) by Tu(x) = u(F(2)). Show that T’ : M(X) — M(X) is given by 
(T’)(E) = p(F~*(£)), for any Borel set E CX. Using Exercise 3 of §3, show that 
there is a probability measure 1 on X such that T’ ps = pu. 


7. 


10. 
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Let V be a Banach space 2 C C an open set. A continuous function F’ : 2 — L(V) is 
said to be weakly holomorphic if for each v € V,w € V’, the map z +> (F'(z)v,w) is 
a complex-valued holomorphic function on (2. Produce analogues of Exercises 11-13 
of §4 in this setting. In particular show that, if Dr(zo) C 2, with boundary ¥, then 


F(z) = S~ Galz —zo)*, for z € Dr(z0), 
k=0 


1 
Goes 
8 Oni 
4 


F(C)\(G- 2) "dG ELV), ||Gell < sup FQ R*. 


. Given T € L(V), set r(L’) = sup{|z| : z € o(T)}, the spectral radius of T’. Show that 


1/r(T) is the maximum value of R such that (I — zT’)~' is holomorphic on Dr(0). 
Deduce that 1/r(T) is the radius of convergence of the power series 


are 


0 


co 
k= 


Then deduce that 
r(T) = limsup ||T*||'/*. 
k-oo 


In the setting of Exercise 8, show that also 
r(T) < inf TR", 
k>1 
and deduce that 
r(T) = lim leacll ger 
Hint. From the identity ¢* — T* = (¢ — T)(¢*-' + CP °7 +--- + 7°71), get the 
first implication in 
¢€0(T) => C* €o(T"*) 
k ke ky 1/k 
= ILS ITT = [el see” 
Let W be a finite-dimensional subspace of the Banach space V. Show that there exists 
a linear projection P € L(V) of V onto W. 


Hint. Let {w1,..., we} bea basis of W, {w1,...,we} C W’ its dual basis. Use Hahn- 
Banach to extend each w; to 3; € V’, and consider 


£ 


Pv= Sv, Bj) w;. 


j=l 
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6. Compact operators 


Throughout this section we will restrict attention to operators on Banach spaces. 
An operator T € L(V,W) is said to be compact provided T takes any bounded 
subset of V to a relatively compact subset of W, that is, a set with compact clo- 
sure. It suffices to assume that T'(B,) is relatively compact in W, where B, is the 
closed unit ball in V. We denote the space of compact operators by K(V, W). The 
following proposition summarizes some elementary facts about K(V, W). 


Proposition 6.1. K(V,W) is a closed, linear subspace of L(V,W). Any 
T in L(V,W) with finite-dimensional range is compact. Furthermore, if 
Te KV, W), Si € L(V, V), and Sz € L(W, W2), then SoT'S; € K(Y, W4). 


Most of these assertions are obvious. We show that if T; € K(V,W) is norm 
convergent to T’, then T' is compact. Given any sequence (x,,) in B,, one can pick 
successive subsequences on which 7) z,, converges, then T>x,, converges, and so 
on, and by a diagonal argument produce a single subsequence (which we'll still 
denote (z,,)) such that for each j, T;z, converges as n —> oo. It is then easy to 
show that Tx, converges, giving compactness of T’. 

A particular case of Proposition 6.1 is that K(V) = K(V, V) is a closed, two- 
sided ideal of L(V). 

The following gives a useful class of compact operators. 


Proposition 6.2. If X is a compact metric space, then the natural inclusion 
(6.1) u: Lip(X) — C(X) 
is compact. 


Proof. It is easy to show that any compact metric space has a countable, dense 
subset; let {xj : 7 = 1,2,3,...} be dense in X. Say (f;,) is a bounded sequence 
in Lip(X). We want to prove that a subsequence converges in C(X). Since 
bounded subsets of C are relatively compact, we can pick a subsequence of (f,,) 
converging at x); then we can pick a further subsequence of this subsequence, 
converging at x2, and so forth. The standard diagonal argument then produces a 
subsequence (which we continue to denote (f,,)) converging at each z;. We claim 
that (f,,) converges uniformly on X, as a consequence of the uniform estimate 


(6.2) lfn() — fnly)| < K d(z,y), 


with K independent of n. Indeed, pick ¢ > 0. Then pick 6 > O such that 
K6 < ¢/3. Since X is compact, we can select from {x,;} finitely many points, 
say {x1,...,cn}, such that any « € X is of distance < 6 from one of these. 
Then pick M so large that f,,(x;) is within e/3 of its limit for 1 < 7 < N, for all 
n > M.Now, for any x € X, picking ¢ € {1,..., N} such that d(x, x) < 6, we 
have, fork >0,n > M, 
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lfn+e(@) — fr(z)| < |frtk(2) — fr+e(xe)| 
(6.3) + |fn+n(xe) — fn(%e)| + |fn(ze) — fn(x)| 
<Ké+ : 4+K6 <e, 


proving the proposition. 


The argument given above is easily modified to show that . : Lip*(X) > 
C(X) is compact, for any a > 0. Indeed, there is the following more general 
result. Let w : X x X — [0,0o) be any continuous function, vanishing on the 
diagonal A = {(z,x) : x € X}. Fix K € R°. Let F be any subset of C(X) 
satisfying 


(6.4) ju(x)| << K,  |u(x) — uly)| < K w(a,y). 


The latter condition is called equicontinuity. Ascoli’s theorem states that such a 
set F is relatively compact in C(X) whenever X is a compact Hausdorff space. 
The proof is a further extension of the argument given above. 

We note another refinement of Proposition 6.2, namely that the inclusion 
: Lip*(X) — Lip?(X) is compact whenever 0 < 8 < a < 1, X a compact 
metric space. Compare results on inclusions of Sobolev spaces given in Chap. 4. 

We next look at persistence of compactness upon taking adjoints. 


Proposition 6.3. IfT € K(V,W), then T’ is also compact. 


Proof. Let (w,,) be sequence in Bj, the closed unit ball in W’. Consider (w,) 
as a sequence of continuous functions on the compact space X = T(B,), By 
being the unit ball in V. Ascoli’s theorem, indeed its special case, Proposition 
6.2, applies; there exists a subsequence (w,,, ) converging uniformly on X. Thus 
(T’wy,) is a sequence in V’ converging uniformly on Bj, hence in the V’-norm. 
This completes the proof. 


The following provides a useful improvement over the a priori statement that, 
for T € K(V,W), the image TB) of the closed unit ball By C V is relatively 
compact in W. 


Proposition 6.4. Assume V is separable and reflexive. IfT : V — W is compact, 
then the image of the closed unit ball By C V under T is compact. 


Proof. From Proposition 4.4 and the remark following its proof, B, with the 
weak*-topology (the o(V, V’)-topology, since V = V”), is a compact metric 
space, granted that V’ is also separable, which we now demonstrate. 

Indeed, for any Banach space Y, it is a consequence of the Hahn—Banach the- 
orem that Y is separable provided Y’ is separable; see Exercise 14 of §4. If Y is 
reflexive, this implication can be reversed. Consequently, for V reflexive and sep- 
arable, given a sequence v,, € By, possessing a subsequence os) such that Tu 
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converges in W, say to w, you can pass to a further subsequence uv?) , which is 
weak*-convergent in V, with limit v € B,. It follows that T’ y?) is weakly con- 
vergent to Tv; for any w € W’, Tu? (w) = ve) (T'w) > v(T’w) = (Tv)(w). 
Hence T'v = w. This shows that T(B,) is closed in W, and hence completes the 
proof. 


Remark: It is possible to drop the assumption that V is separable, via an argument 
replacing sequences by nets in order to construct the weak* limit point v. 

We next derive some results on the spectral theory of a compact operator A on 
a Hilbert space H that is self-adjoint, so A = A*. For simplicity, we will assume 
that H is separable, though that hypothesis can easily be dropped. 


Proposition 6.5. If A € L(H) is compact and self-adjoint, then either || A\| or 
—||Al| is an eigenvalue of A, that is, there exists u # 0 in H such that 


(6.5) A= rd, 


with \ = +||All. 


Proof. By Proposition 6.4, we know that the image under A of the closed unit 
ball in H is compact, so the norm assumes a maximum on this image. Thus there 
exists u € A such that 


(6.6) [ul] = 1, [|Avl] = |All. 


Pick any unit w L wu. Self-adjointness implies || Az||? = (A?z, 2), so we have, 
for all real s, 


(6.7) (A?(u+ sw),uw+ sw) < ||All?(1 + 8”), 
equality holding at s = 0. Since the left side is equal to 

|| Al? +25 Re (A?u, w) + 8”|| Awl, 
this inequality for s — 0 implies Re(A?u,w) = 0; replacing w by iw gives 
(A?u,w) = 0 whenever w | wu. Thus A?u is parallel to u, that is, A2u = cu 
for some scalar c; (6.6) implies c = ||A||?. Now, assuming A # 0, set v = 


||Alju + Au. If v = 0, then u satisfies (6.5) with A = —||Al]. If vu 4 0, then v is 
an eigenvector of A with eigenvalue \ = || Al]. 


The space of u € H satisfying (6.5) is called the A-eigenspace of A. Clearly, 
if A is compact and A ¥ 0, such a A-eigenspace must be finite-dimensional. If 
Au; = AjUj, A = A*, then 


(6.8) Ai (ui, U2) = (Aur, u2) = (u1, Au2) = A2(u1, ua). 
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With Ay = Az and uy = ug, this implies that each eigenvalue of A = A%* is 
real. With Ay # Ag, it then yields (uw ,u2) = 0, so any distinct eigenspaces 
of A = A* are orthogonal. We also note that if Au; = A,u; and v | uj, then 
(ui, Av) = (Aur, v) = A1(u1, v) = 0, so A = A* leaves invariant the orthogonal 
complement of any of its eigenspaces. 

Now if A is compact and self-adjoint on H, we can apply Proposition 6.5, 
restrict A to the orthogonal complement of its +]|A||-eigenspaces (where its 
norm must be strictly smaller, as a consequence of Proposition 6.5), apply the 
proposition again, to this restriction, and continue. In this fashion we arrive at 
the following result, known as the spectral theorem for compact, self-adjoint 
operators. 


Proposition 6.6. If A © L(A) is a compact, self-adjoint operator on a Hilbert 
space H, then H has an orthonormal basis uw; of eigenvectors of A. With Au; = 
Aju, (Aj) is a sequence of real numbers with only 0 as an accumulation point. 


The spectral theorem has a more elaborate formulation for general self-adjoint 
operators. It is proved in Chap. 8. 

We next give a result that will be useful in the study of spectral theory of 
compact operators that are not self-adjoint. It will also be useful in §7. Let V, W 
and Y be Banach spaces. 


Proposition 6.7. Let T € L(V,W). Suppose K € K(V,Y) and 
(6.9) llully < Cl|Tullw + Cll Kally, 


for all u € V. Then T has closed range. 


Proof. Let Tu,, > f in W. We need v € V with Tv = f. Let LD = Ker T. We 
divide the argument into two cases. 

If dist(un,L) < a < oo, take v, = up, mod L, ||vp|| < 2a; then Tv, = 
Tun — f. Passing to a subsequence, we have Kv, — gin Y. Then (6.9), applied 
tO U = Un — Um, implies that (v,,) is Cauchy, so v, + v and Tv = f. 

If dist(u,,L) — oo, we can assume that dist(u,,L) > 2 for all n. Pick 
Un = Un mod L such that dist(un,,£) < |lvp|| < dist(u,,L) + 1, and set 
Wn = Un/||Un||. Note that dist(w,,L) > 1/2. Since ||w,|| = 1, we can take 
a subsequence and assume Kw, — g in Y. Since Tw, — 0, (6.9) applied to 
Wn — Wm implies (w,,) is Cauchy. Thus w,, — w in V, and we see that simulta- 
neously dist(w, £) > 1/2 and Tw = 0, a contradiction. Hence this latter case is 
impossible, and the proposition is proved. 


Note that Proposition 6.7 applies to the case V = W = Y andT’'= CI—K, for 
K € K(V) and ¢ a nonzero scalar. Such an operator therefore has closed range. 
The next result is called the Fredholm alternative. 


Proposition 6.8. For ¢ 4 0, K € K(V), the operator T = CI — K is surjective 
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Proof. Assume T is injective. Then T : V — R(T) is bijective. By Propo- 
sition 6.7, R(T) is a Banach space, so the open mapping theorem implies that 
T : V + R(T) is a topological isomorphism. If R(T) = Vj is not all of 
V, then V2 = T(Vi), V3 = T(V2), and so on, form a strictly decreasing fam- 
ily of closed subspaces. By Lemma 1.3, we can pick v, € V, with ||vp|| = 1, 
dist(Un, Vn4i) > 1/2. Thus, forn > m, 


Kum — Kvn = Cum + [—Cun — (Tm — Tvn)] 


= Cup; +r Wmns 


(6.10) 


with Winn € Vin41- Hence || Ku, — Kvm|| > |¢|/2, contradicting compactness 
of A. Consequently, T' is surjective if it is injective. 

For the converse, we use Proposition 5.7. If T’ is surjective, (5.19) implies 
T’ = ¢I—K' is injective on V’. Since K’ is compact, the argument above implies 
T’ is surjective, and hence, by (5.20), T is injective. 


A substantial generalization of this last result will be contained in Proposition 
7.6 and Corollary 7.7. 

It follows that every ¢ 4 0 in the spectrum of a compact K is an eigenvalue of 
K. We hence derive the following result on o(K). 


Proposition 6.9. If K € K(V), the spectrum o(K) has only 0 as an accumula- 
tion point. 


Proof. Suppose we have linearly independent v,, € V, ||vn|| = 1, with Ku, = 
AnUn» An 2 A # 0. Let V,, be the linear span of {v1,..., Un}. By Lemma 1.3, 
there exist yn € Vn, |lYn|| = 1, such that dist(y,,V,—1) > 1/2. With T, = 
AI — K, we have, forn > m, 


An = Ae Ktlen = Un vr [—Ym al Ae Datie sis Mn Deitien| 


= Un T 2nm; 


(6.11) 


where Znm € Vn—1 since Ty, Yn € Vn—1. Hence ||A51Kyn — Az) Kyml| > 1/2, 
which contradicts compactness of I. 


Note that if \; # 0 is such an isolated point in the spectrum o(/‘) of a compact 
operator KX, and we take y; to be a small circle enclosing A; but no other points 
of o(), then, as in (5.43), the operator 


a. -1 
2 ae ((-— KK)" d¢ 
Vi 


is a projection onto a closed subspace V; of V with the property that the restriction 
of Kk to V; (equal to P; K P;) has spectrum consisting of the one point {A, }. Thus 
V; must be finite-dimensional. K’|y, may perhaps not be scalar; it might have a 
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Jordan normal form with A; down the diagonal and some ones directly above the 
diagonal. 

Having established a number of general facts about compact operators, we take 
a look at an important class of compact operators on Hilbert spaces: the Hilbert— 
Schmidt operators, defined as follows. Let H; and Hz be separable Hilbert spaces, 
with orthonormal bases {u,} and {v;}, and let A € £(.H1, H2). We say A isa 
Hilbert-Schmidt operator (or HS operator), and write A ¢ HS( A), H2), provided 


(6.12) ||Allfis = 55 || Ausll? < 00. 
k 


In case H, = Hy = H, we say A € HS(#). Note that 
Allis = S> Aus ll? = 52 (Aus, v9)? 
k 7k 
(6.13) = 0 |(ux, A*vs)|? = 55 Ate, I? 
ik d. 


= ||A* lus. 
We also write 
(6.14) Allis = So lajel?, aye = (Aux, 05). 
5k 


The defining formula (6.12) implies || Al|Hs is independent of the choice of 
orthonormal basis {v;}, and the end of (6.13) shows that ||Al|Hs is also inde- 
pendent of the choice of orthonormal basis {w;,}. The identity (6.12) implies 


|BAllus < || Bl - ||Allus 
if B € L( Az), with equality if B is unitary. Then (6.13) gives 


| AC||us = ||C*A*||Hs 
<1C*|| -A* lus 
= |AllasilCll, 
for C € £(H;), again with equality if C is unitary. 
From (6.12) it follows that an HS operator A is a norm limit of finite-rank 


operators, hence compact. If A = A*, and we choose an orthonormal basis of 
eigenvectors of A, with eigenvalues ju;, then 


(6.15) So lH5l? = IlAllés- 
F 
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A compact, self-adjoint operator A is HS if and only if the left side of (6.15) is 
finite. 

The following classical result might be called the Hilbert—-Schmidt kernel the- 
orem. In Chap. 4 it is used as an ingredient in the proof of the celebrated Schwartz 
kernel theorem. 


Proposition 6.10. If T : L?(X1, 1) + L? (Xe, u2) is HS, then there exists a 
function K € L?(X1 x Xo, fy X 2) such that 


(6.16) (Tu, v),2 = ) K (a1,22)u(a)o(@r) dyin (a1) dyn (2). 
Proof. Pick orthonormal bases { f;} for L?(X1) and {g;,} for L?(Xo), and set 


K (a1, 22) = aie) X1) 9k (£2), 
jk 


where ajx = (T'f;, 9x). The hypothesis that T is HS is precisely what is necessary 
to guarantee that K € E(x. 1 X X2), and then (6.16) is obvious. It is also clear 
that 


(6.17) This = AMZ. 
Also of interest is the converse, proved simply by reversing the argument: 


Proposition 6.11. If K € L?(X, x Xo, X 2), then (6.16) defines an HS 
operator T, satisfying (6.17). 


We note that the HS-square norm polarizes to a Hilbert space inner product on 
HS (H 1) Hi: 2 ) 7 


(6.18) (A, B)us = dash 


if, parallel to (6.14), bj, = (Bux, v,;). Since the norm uniquely determines the 
inner product, we have without further calculation the independence of (A, B)ys 
under change of orthonormal basis; more generally, (A, B)ys = (UAV, UBV )us 
for unitary U and V on Hy and Ay. 

Note that )7, ajnber = cje form the matrix coefficients of C = AB*, and 
(6.18) is the sum of the diagonal elements of C’; we write 


(6.19) (A, B)ys = Tr AB*. 


Generally, we say an operator C € £(#) is trace class if it can be written as a 
product of two HS operators; call them A and B*, and then Tr C is defined to 
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be given by (6.19). It is not clear at first glance that TR, the set of trace class 
operators, is a linear space, but this can be seen as follows. If Cj = Aj B; , then 


(6.20) Cy + Cy = (Ar Ad) & . 
2 


Note that a given C' € TR may be written as a product of two HS operators in 


many different ways, but the computation of Tr C’ is unaffected, since as we have 
already seen, the definition (6.19) leads to the computation 


(6.21) TrO =) ej, cj = (Cuj, uj). 
7, 


This formula shows that Tr:TR — C is a linear map. Furthermore, by our previ- 
ous remarks on ( , )ys, the trace formula (6.21) is independent of the choice of 
orthonormal basis of 7. 

There is an intrinsic characterization of trace class operators: 


Proposition 6.12. An operator C € L(A) is trace class if and only if C is com- 
pact and the operator (O*C)'/? has the property that its set of eigenvalues {dj} 
is summable; )~ X; < 00. 


Proof. Given C compact, let {u;} be an orthonormal basis of H consisting of 
eigenvectors of C*C’, which is compact and self-adjoint. Say C*Cu; = Niu, 
dj > 0. Then the identity (C*C)!/?u,; = Aju; defines (C*C)!/?. 

Note that, for all v € A, 


(6.22) \(C*O)/7 vl]? = (C*Cv, v) = ||CrI|?. 


hus Cv ++ (C*C)!/2v extends to an isometric isomorphism between the ranges 
of C and of (C*C)!/2, yielding in turn operators V and W of norm 1 such that 


(6.23) C=v(c*c)?, (c*c)?=Wwe. 


Now, if }> A; < 00, define A € L(A) by Au; = gore Hence A is Hilbert— 
Schmidt, and C = VA.- A, so C is trace class. Conversely, if C = AB* with 
A, B € HS, then (C*C)!/? = WA. B* is a product of HS operators, hence of 
trace class. The computation (6.21), using the basis of eigenvectors of C*C, then 
yields S> \; = Tr(C*C)*/? < 00, and the proof is complete. 


It is desirable to establish some results about TR as a linear space. Given C' € 
TR, we define 


(6.24) |Cllnx = inf {|| Allus||Bllus : C = AB"}. 


This is a norm; in particular, 
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(6.25) |C1 + Collar < ||Cillrr + ||Callrr- 


This can be seen by using (6.20), with A replaced by tA» and B3 by t~'B3, and 
minimizing over t € (0,00) the quantity 


I|(A1, ta) [lis + [|B 7 * Ba) lis 
= (llAillis + #7] Aallas) + (lBillfis + #7 || Balls). 


Next, we note that (6.24) easily yields 

(6.26) |C* ltr = ||Cllrr 

and, for bounded S;, 

(6.27) |SiC'Sollrr < ||Si|] + ||]Cllar + |] Sell, 

with equality if S; and Sz are unitary. Also, using (6.23), we have 
(6.28) [|Cllne = ||(C*C)*/? Ir. 


Using (6.24) with C replaced by D = (C*C)1/?, the choice A = B = D!/? 
yields 


(6.29) |(C*C)'? ler < ||(C*C)/4 las = Te(C*C)*??. 
On the other hand, we have, by (6.19) and Cauchy’s inequality, 
(6.30) |Tr(AB*)| < ||Allus|| Bllus, 

and hence, for C € TR, 

(6.31) |Tr C| < ||Cllrr. 


If we apply this, with C’ replaced by (C*C)!/2, and compare with (6.28)-(6.29), 
we have 


(6.32) |Cllrg = Tr (C*C)1/?, 
Either directly or as a simple consequence of this, we have 
(6.33) |Clltx 2 ||Cllus = ||C']. 


We can now establish: 
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Proposition 6.13. Given a Hilbert space H, the space TR of trace class operators 
on H1 is a Banach space, with norm (6.24). 


Proof. It suffices to prove completeness. Thus let (C’;) be Cauchy in TR. Passing 
to a subsequence, we can assume ||Cj41 — C;||rR < 8-7. Then write C = > C;, 
where Ci = C, and, for 7 > 2, C; = C; — Cj_-1. By (6.33), C is a bounded 
operator on 7. Write 


C; = A;Bj, Alls, ||Bllus < 2-7. 
Then we can form 
A=A,0A,9-:: ; B=B,0B.@---€ L(IH,H), 


where H = H@®H@:---, check that A and B are Hilbert—-Schmidt, and note that 
C = AB*. Hence C € TR and C; > C in TR-norm. 


The classes HS and TR are the most important cases of a continuum of ideals 
I, C L(H),1 < p < oo. One says C € K(H) belongs to Z, if and only if 
(C*C)?/? is trace class. Then TR = Z; and HS = Zp. For more on this topic, 
see [Si]. 

We next discuss the trace of an integral operator. Let A and B be two HS 
operators on L?(X, jz), with integral kernels K4, Kp € L?(X x X, x 1). Then 
C = AB is given by 
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and we have, by (6.17) and (6.19), 

(6.35) TrC= // Ka(a,z)Kp(z,x) du(z) du(x). 
Now C has an integral kernel Kg € L?(X x X,p X pu): 

(636) Kota) = f Ka(e.2)Ka(2,v) dua), 
which strongly suggests the trace formula 

(6.37) TrC= [ Ket, x) d(x). 


The only sticky point is that the diagonal {(x, x) : « € X} may have measure 0 
in X x X, so one needs to define Kc(z, y) carefully. The formula (6.35) implies, 
via Fubini’s theorem, that 
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Ko(na) = f Ka(e.2)Kx(2.2) dul2) 


exists for j:-almost every x € X, and for this function, the identity (6.37) holds. In 
many cases of interest, X is a locally compact space and K(x, y) is continuous, 
and then passing from (6.35) to (6.37) is straightforward. 

We next give a treatment of the determinant of J + A, for trace class A. This 
is particularly useful for results on trace formulas and the scattering phase, in 
Chap. 9. Our treatment largely follows [Si]; another approach can be found in 
Chap. 11 of [DS]. 

With A/C the operator induced by C' on A’ H, we define 


(6.38) det (I+ C) =1+ 5° Tr AVC. 


j21 
It is not hard to show that if C; = A’C and Dj = (C¥C;)'/, then D; = 
Ai(C*C)'/?, so 


(6.39) Cyl =TrDy= Spay Hay, 


ti <i <ty 


where 4;, 7 > 1, are the positive eigenvalues of the compact, positive operator 
(C*C)'/?, counted with multiplicity. In particular, 


1 ; 
(6.40) |Cyllar < aC lite: 


so (6.38) is absolutely convergent for any C'€ TR. Note that in the finite- 
dimensional case, (6.38) is simply the well-known expansion of the characteristic 
polynomial. Replacing C' by zC’, z € C, we obtain an entire holomorphic function 
of z: 


(6.41) det (I+ 2C) =1+ 5° 2? Tr AIC. 


j21 
This replacement causes D, to be replaced by |z|’ D;, and (6.39) implies 


(6.42) det (I + 2C)| < det (I +|2|D) = ] [G+ wulz)), 


i> 


the latter identity following by diagonalization of the compact, self-adjoint oper- 
ator D. Note that since 1 + r < e”, forr > 0, 


(6.43) [[G+eilel) se, Ke= So i. 


i>e i>e 
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Taking @ = 1, we have 


(6.44) Jdet (I + 2C)| < el2illClhe, 
Also, 

£=1 
(6.45) det (I+ 2C)| < {Ta + uilel) bere" ve. 


i=1 
Hence, for any C' € TR, 
(6.46) |det (I+ zC)| < Cee"!, Ve>0. 
Proposition 6.14. We have a continuous map F:TR — C, given by 


F(A) = det(I + A). 


Proof. For fixed C,D € TR, g(z) = F(C' + zD) is holomorphic, as one sees 
from (6.40) and (6.41). Now consider 


(6.47) h(z) =F Gu + B)+2(A- B)) 


Then 
(6.48) 


<R' sup [h(z)|. 
|z|SR+1/2 

In turn, we can estimate |h(z)| using (6.45). If we take R = || A — B||7,, we get 

(6.49) |F(A) — F(B)| < ||A — Bllrr exp{||Allrr + || Bllor + 1f, 


which proves the proposition. 
One use of Proposition 6.14 is as a tool to prove the following. 
Proposition 6.15. For each A, B € TR, 
(6.50) det (I+ A)(I+ B)) = det (J + A)- det (I+ B). 
Proof. By Proposition 6.14, it suffices to prove (6.50) when A and B are finite 


rank operators, having matrix entries a; and b;; that vanish unless j,k < N, for 
some JN, in which case it is elementary. 
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The following is an important consequence of (6.50). 
Proposition 6.16. Given A € TR, we have 
(6.51) I+ Ainvertible <=> det (I+ A) £0. 
Proof. If J + A is invertible, the inverse has the form 
(6.52) (I+ A)-'=I+B, B=—A(I+ A) € TR. 
Hence (6.50) implies det(J + A) det(J + B) = 1, so det(I + A) 4 0. 
For the converse, assume J + A is not invertible, so —1 € Spec (A). Since A 


is compact, we can consider the associated spectral projection P of H onto the 
generalized (—1)-eigenspace of A. Since (PA)(I — P) A = 0, we have 


(6.53) det (I + A) =det(I+ AP)- det(I+ A(I— P)). 

It is elementary that det(I + AP) = 0, so the proposition is proved. 
As another application of (6.50), we can use the identity 

(6.54) I+A+sB= (I+ A)(I+s(1+A)7'B) 


to show that 


(6.55) < det (I+ A(s)) = det (I+ A(s)) - Tr((I + A(s))“1A(s)), 


when A(s) is a differentiable function of s with values in TR. 


Exercises 


1. If A is a Hilbert-Schmidt operator, show that 
I|All < || Allus, 


where the left side denotes the operator norm. (Hint: Pick unit uw such that || Au1|| > 
|| A|| — €, and make that part of an orthonormal basis.) 
2. Suppose K € L?(X x X,p x p) satisfies K(x, y) = K(y, x). Show that 


Key) = "cy tig) us) 


with {u,;} an orthonormal set in L?(X, 1), c; € R, and 39 cj < 0. 
(Hint: Apply the spectral theorem for compact, self-adjoint operators.) 
3. Define T : L?(I) > L?(I), I = [0,1], by 


Tf(x) = [ f(y) dy. 
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Show that T has range R(T’) C {u € C(L) : u(0) = O}. Show that T is compact, that 
T has no eigenvectors, and that o(T’) = {0}. Also, show that T is HS, but not trace 
class. 

4. Let K be a closed bounded subset of a Banach space B. Suppose T; are compact 
operators on B and Tjx — «x for each x € B. Show that K is compact if and only if 
T; — I uniformly on kK. 

5. Prove the following result, also known as part of Ascoli’s theorem. If X is a compact 
metric space, B; are Banach spaces, and K : B, — Bz is a compact operator, then 
K f(a) = K(f(a)) defines a compact map k : C°(X,Bi) — C(X, Ba), for any 
a> 0. 

6. Let B be a bounded operator on a Hilbert space H, and let A be trace class. Show that 


Tr(AB) = Tr(BA). 


(Hint: Write A = A; Ao with A; € HS.) 

7. Given a Hilbert space H, define A’ H as a Hilbert space and justify (6.39). Also, check 
the finite rank case of (6.50). 

8. Assume {w; : 7 > 1} is an orthonormal basis of the Hilbert space H, and let P,, denote 
the orthogonal projection of H onto the span of {u1,..., Un}. Show that if A € TR, 
then P, AP, — A in TR-norm. (This is used implicitly in the proof of Proposition 
6.15.) 
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Again in this section we restrict attention to operators on Banach spaces. An 
operator T € L(V, W) is said to be Fredholm provided 


(7.1) Ker T is finite-dimensional 
and 
(7.2) T(V) is closed in W, of finite codimension, 


that is, W/T(V) is finite-dimensional. We say T belongs to Fred(V,W). We 
define the index of T' to be 


(7.3) Ind T = dim Ker T — dim W/T(V), 


the last term also denoted Codim T(V). The following results on Ker(T’) and 
T(V) for Fredholm T will be of use. 


Lemma 7.1. If (7.1) holds, there exists a continuous projection Py € L(V) of V 
onto Ker T. Hence 
V =KerT 6 VV, 


with Vi = R(P,) = R(I — Po). If (7.2) holds, there exists a finite-dimensional 
linear space Wo C W such that 
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W=T(V)®Wo. 

Proof. The first part is a special case of Exercise 10 of §5. As for the second part, 
if 2= dim W/T(V), take a basis of W/T(V) and pick preimages {w1,..., we} 
in W. Then set Wo = Span{wi,...,we}. We have the continuous bijection 
T(V)®Wo — W, which, by the open mapping theorem, is a topological isomor- 
phism. 

We next relate Fredholm properties of T’ to those of T’. So assume T € 


Fred(V, W). Note the isomorphism (W/T(V))’ a T(V)~. By 6,19), T(V)> = 
Ker T’. Consequently, 


(7.4) Ind T = dim Ker T — dim Ker T’. 


Furthermore, Proposition 5.7 implies T’(W’) is closed in V’ and (KerT)+ = 
T’(W’). Hence we have 


(Ker T)’ = V’/(Ker T)+ = V'/T'(W’), 
the first isomorphism via the Hahn-Banach theorem. We deduce the following. 
Proposition 7.2. [fT is Fredholm, T' € L(W', V’) is also Fredholm, and 
(7.5) Ind T” = —IndT. 
The following is a useful characterization of Fredholm operators. 


Proposition 7.3. Let T € L(V,W). Then T is Fredholm if and only if there exist 
5S; € L(W,V) such that 


(7.6) SiT=1+ hk, 
and 
(7.7) TS,=I+4+ Ko, 


with IX, and Kz compact. 


Proof. The identity (7.6) implies Ker TC Ker(I + K,), which is finite- 
dimensional. Also (7.6) implies ||v||_ << ||.S1Tv|| + || Kyl], so by Proposition 
6.7, T has closed range. On the other hand, (7.7) implies T(V) contains the range 
of I + Ko, and we know from §6 that X = R(I + Ko) is closed in W and 
X+ x Ker(I + K$), which is finite dimensional. 


For the converse, assume T’ is Fredholm, and write the decompositions in Lemma 
7.1 as 
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V = KerT OV, = R(Po) @ R(P,), 
W =T(V) 6 Wo = R(Q1) © R(Qo), 


where P; and @, are the associated projections. We have 
T=T\,,:Vi > T(V). 
Hence we define S : W — V by 
S=7, Ox 


This gives 
TS = Q1 =I- Qo, 
and 
ST=STP,=P,=I1-P, 


since TP; € R(Q1). Thus we have (7.6)-(7.7), with S; = S, ky = —Po, Ko = 
—Qo. 


The maps S; in Proposition 7.3 are called Fredholm inverses of 7’. Note that, 
by virtue of the identity 


(7.8) Si(I + Ky) = SyTS2 = (I+ K1)S2, 


we see that whenever (7.6) and (7.7) hold, S; and Sj must differ by a compact 
operator. Thus the Fredholm inverse is uniquely determined, up to the addition of 
a compact operator. 

The following result is an immediate consequence of the characterization of 
the space Fred(V, W) by (7.6)-(7.7). 


Corollary 7.4. [fT © Fred(V,W) and K : V + W is compact, thenT + K € 
Fred(V, W). If also Tz € Fred(W, X), then ToT € Fred(V, X). 


Proposition 7.3 also makes it natural to consider the quotient space Q(V) = 
L(V)/K(V). Recall that K(V) is a closed, two-sided ideal of £(V). Thus the 
quotient is a Banach space, and in fact a Banach algebra. It is called the Calkin 
algebra. One has the natural algebra homomorphism 7 : L(V) — Q(V), and 
a consequence of Proposition 7.3 is that T € L(V) is Fredholm if and only if 
m(T) is invertible in Q(V). For general T € Fred(V, W), the operators S,T and 
T'S in (7.6) and (7.7) project to the identity in Q(V) and Q(W), respectively. 
Now the argument made in §5 that the set of invertible elements of £(V) is open, 
via Proposition 5.11, applies equally well when L(V) is replaced by any Banach 
algebra with unit. Applying it to the Calkin algebra, we have the following: 


Proposition 7.5. Fred(V, W) is open in L(V, W). 


We now establish a fundamental result about the index of Fredholm operators. 
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Proposition 7.6. The index map 
(7.9) Ind: Fred(V,W) —> Z 
defined by (7.3) is constant on each connected component of Fred(V, W). 
Proof. Let T € Fred(V,W). It suffices to show that if S@£(V,W) and if 
|Z — S|| is small enough, then Ind S' = Ind 7. As in Lemma 7.1, we can pick 
a closed subspace V; C V, complementary to Ker T and a (finite-dimensional) 
Wo Cc W, complementary to T(V), so that 
(7.10) V=Vi90 KerT, W=T(V)OW). 
Given S € L(V, W), define 
(7.11) Tg:Vi BW OW, Ts(v,w) = Sut w. 
The map Tr is an isomorphism of Banach spaces. Thus ||T'— 5'|| small implies Ts 
is an isomorphism of V; 6 Wo onto W. We restrict attention to such S, lying in 
the same component of Fred(V, W) as T. 

Note that t3(Vi) is closed in W, of codimension equal to dim Wo; now 
Ts(Vi) = S(V\), so we have the semicontinuity property 
(7.12) Codim S(V) < Codim T(V). 
We also see that Ker SM V; = 0. Thus we can write 


V= KerSOZO”Vi, 


for a finite-dimensional Z C V. _ S is injective on Z @ Vj, taking it to S(V) = 
S(Z) ® S(Vj), closed in W, of finite codimension. It follows that 


(7.13) Codim S(V) = Codim T(V) — dim S(Z), 
while 
(7.14) dim Ker S + dim Z = dim Ker T. 


Since S(Z) and Z have the same dimension, this gives the desired identity, 
namely Ind S = Ind T. 


Corollary 7.7. IfT € Fred(V,W) and K € K(V,W), then T + K and T have 


the same index. 


Proof. For s € [0,1], T+ sk € Fred(V, W). 
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The next result rounds out a useful collection of tools in the study of index 
theory. 


Proposition 7.8. [fT € Fred(V, W) and S' € Fred(W, X), then 
(7.15) Ind ST = IndS'+IndT. 


Proof. Consider the following family of operators in L(V 6 W,W @ X): 


I O cost sint T 0O 
TA 
ae cc ee =) (| i 


the middle factor belonging to £(W © W). For each t € R, this is Fredholm. For 


t = 0, it is 
T O 
G. oy? 


of index Ind T+ Ind S, while for ¢ = —7/2, it is 


0 -I 

fo ame i a 
of index Ind ST. The identity of these two quantities now follows from 
Proposition 7.6. 


Analytic Fredholm theory 


The following analytic Fredholm theorem is of frequent use in analysis, in such 
areas as scattering theory and the theory of layer potentials. (See Proposition 7.4 in 
Chapter 9.) Let U Cc C be an open, connected set, and let A(z) be a holomorphic 
family of linear operators on a Hilbert space H, of the form J + K(z), where, 
for all z € U, K(z) € K(#). Assume there exists zo € U such that A(zo) is 
invertible. The result is that 


(7.17) S = {z€U: A(z) is not invertible} 


is a discrete subset of U. 

This can be extended to the following more flexible result. Namely, let A(z) 
be a holomorphic function of z € U with values in £(H), and assume A(z) is 
Fredholm for each z € U. As long as A(z) is invertible for some zo € JU, it 
again follows that S' is a discrete subset of U. As we will see below, a proof can 
be obtained by reduction to the previous case. 

Here we not only establish such a result, but work in the setting of multidimen- 
sional analytic Fredholm theory. We establish the following. 
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Theorem 7.9. Let U Cc C” be an open, connected set, and let A(z) be a holo- 
morphic family on U, with values in £L(H), such that A(z) is Fredholm for each 
z € U. Assume A(2q) is invertible for some zo € U, and let S' be given by (7.17). 
Then S is either empty or a local complex-analytic subvariety of U, of complex 
codimension one. 


The proof has two parts. The first is a reduction to the case where A(z) has the 
form I + K(z), with A(z) € K(#H), and the second is a proof in that case. 

To begin the proof, we may as well assume S 4 Q. Let O = U \ S, which is 
open in U, and let z1 € S be a boundary point of O. It suffices to demonstrate the 
following. 


Proposition 7.10. There exists a polydisk B C U, centered at 21, such that SN B 
is a complex-analytic subvariety of B, of complex codimension one. 


To proceed, take a Fredholm inverse B to A(z,), so A(z,)B—J and BA(z,)— 
I are compact. Since A(z) is a norm limit of invertible operators, we know A(z) 
has index 0, hence so does B. Adding a finite-rank operator to B if necessary, we 
can arrange that B is invertible. Now work with 


(7.18) CO(z)= BAZ), 


so C(z1) = I+ K, K compact. Denote by C(z) the image of C(z) under the 
natural projection 


(7.19) nw: L(H) — £L(A)/K(#A). 

We have C(z) holomorphic and invertible in £(H)/K(H), with C(z1) = I, hence 
(7.20) C(z) =I1+ D(z), D(z) =0. 

Hence, on some polydisk B centered at z1, we have a convergent power series 


eel C(z) t=I+ So Ca(z-m1)%, Ca € £(H)/K(H). 


a>0 

Here a = (qj,..., Qn) is a multi-index and w® = wt --- we”. Now pick 
(7.22) Cee L(A); (Cy) =Cey, (\Coll < 21a. 

Then 

(7.23) E(z) =I+ S> Calz—- 21)? 


a>0 


is a convergent power series on B, with values in L( HH). We have 
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(7.24) n(E(z)O(z)) = C(z)1C(z) =I in £(H)/K(#), 
hence 
(7.25) E(z)C(z) =I+K(z), K(z) €K(#). 


Note that (7.23) guarantees E(z) is invertible, at least on some smaller polydisk 
B°, centered at z,. Hence A(z) and J + K(z) are simultaneously invertible for 
each z € B?. 

There remains the task of proving Proposition 7.10 in case 


(7.26) A(z) =I1+K(z), K(z) € K(A). 


We do this by using an attack suggested (in the one-variable case) in [Mel] (p. 17. 
footnote 7). Write 


(7.27) K(z) =Ko+ 5) Kalz-2)*, Ko= Ke + Ke, 
a>0 

with 
# en by — 1 

(7.28) Ky finite rank, || Kg|| < rt 


Also pick 5 > 0 so small that Bs(z1) = {z : |z — z1| < 6} C B? and 


i 
(7.29) jpn) eo— (>> Ka(z— 21)" < 3. 
a>0 
Then 
1 
(7.30) L(z) =KB+ )/ Kale a)" = IL@Il< 5, 
for |z — z1| < 6, and we have 
A(z) =I+L ke 
(731) (z) =1+ L(z) + KG 
= (1+ Lz) + K*(z)), 
where 
(7.32) K#*(z) = (I+ L(z)) “Ke. 


We see that, for |z — zo| < 6, A(z) and I + K*(z) are simultaneously invertible. 
Now since i, a has finite rank, K(z) is trace class, and 
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(7.33) &(z) = det(I + K*¥(z)) 


is holomorphic in |z — 21| < 6. By Proposition 6.16 in this appendix, J + K*(z) 
is invertible for such z if and only if &(z) 4 0. Hence 


(7.34) {z:|z-za|] < 6}NS = {z © Bs(z) : &(z) = Of. 


This proves Proposition 7.10, hence Theorem 7.9. 


Benjamin Delarue has pointed out (personal communication) that one can take 
this argument a step further, and analyze A(z)~! as a meromorphic operator- 
valued function on U, with finite-rank singularities on S. Following his sugges- 
tion, we present such a derivation. 

Take z; € S. As seen above, we can alter A(z) to have the form (7.26), with 
K : 2. — K(#), holomorphic on a neighborhood 2 of z; in U. We have —1 as 
an isolated point in o( 4 (z)). Hence we can take 6 > 0 such that 


(7.35) Ds{-) = {Ce C: |C +1) <4} 

contains only —1 in o(K(z1)). Then we can take a > 0 such that 

(7.36) 5 = OD5(—1) is disjoint from o(K(z)), for z € Ba(21). 
Then we form the holomorphic family of projections 


(7.37) P(z) wey [GE - KUe) 8 e, z € B,(21). 


207% 
6 


We have direct sum decompositions (not necessarily orthogonal) 


(7.38) H=fy OE, Bye = RPO). Eig] Ke Pe), 
yielding 
(7.39) A(z) = Ao(z) ® Ai(z), Aj(2) = AQ) |p. 


the operators A;(z) depending holomorphically on z, and 
(7.40) A(z)"* = Ap(z) + @Ar(z) 1, 2€ Boz) \S, 
with A;(z) invertible for all z € By(z1). 
Now each space Ep, has dimension d = dim Ker A(z1). If {v1,...,va} isa 


basis of Ker A(z1), then, after perhaps shrinking a, we have that 


(7.41) {v;(z) = P(z)uj; :1 <j < d} isa basis of Eoz, for z € Ba(21). 
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We denote 


(7.42) Ao(z) € M(d,C) 


the matrix of A(z) with respect to this basis. Then Ao(z) is invertible on By(z1)\ 
S, and Cramer’s formula gives 


(7.43) Ao(z)~1 = (det Ao(z))~ “Co(z), 

with Cy : Ba(z1) + M(d,C) holomorphic. This presents 

(7.44) A(z)~* = (det A(z) "Co(z) ® Ariz)", 7 € Bala) \ S, 
and establishes the following. 


Proposition 7.11. In the setting of Theorem 7.9, each z1 € S has a neighbor- 
hood B,(z1) C U on which A(z)~? is the product of an invertible holomorphic 
operator-valued function and a meromorphic operator-valued function of the form 
(7.44), with Ag, Co, and A; holomorphic on By(21), Ao(z) € M(d, C) invertible 
on Ba(z1) \ S, and Co(z) of rank d = dim Ker A(z1). 


Exercises 


Exercises 1-4 may be compared to Exercises 3-7 in Chap.4, §3. Let H denote the 
subspace of L?(S') that is the range of the projection P: 


PF) = D7 F(nje™. 
n=0 
Given y € C(S'), define the “Toeplitz operator” T, : H > H by Tp,u = P(pu). 
Clearly, ||T’o|| < |l¢llsup- 
1. By explicit calculation, for y(@) = Ex (0) = e**®, show that 


Tr, Te, — Tz, 5, is compact on H. 


2. Show that, for any y, w € C(S"), TeTy — Toy is compact on H. (Hint: Approximate 
y and w by linear combinations of exponentials.) 

3. Show that if ¢ € C(S") is nowhere vanishing, then T,, : H — H is Fredholm. 
(Hint: Show that a Fredholm inverse is given by Ty, (0) = y(0)~*.) 

4. A nowhere-vanishing ¢ € C(S") is said to have degree k € Z if —y is homotopic to 
E;,(0) = e'*®, through continuous maps of S* to C \ 0. Show that this implies 


Index T,, = Index Tr, . 


Compute this index by explicitly describing Ker Tz, and Ker Tz, . Show that the cal- 
culation can be reduced to the case k = 1. 
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8. Unbounded operators 


Here we consider unbounded linear operators on Banach spaces. Such an operator 
T between Banach spaces V and W will not be defined on all of V, though for 
simplicity we write T : V + W. The domain of T, denoted D(T), will be some 
linear subspace of T’. Generalizing (5.17), we consider the graph of T: 


(8.1) Gr ={(v,Tv) €EVOW: ve D(T)}. 


Then G’r is a linear subspace of V 6 W; if Gr is closed in V @ W, we say T isa 
closed operator. By the closed-graph theorem, if T is closed and D(T’) = V, then 
T is bounded. If T is a linear operator, the closure of its graph Gp may or may 
not be the graph of an operator. If it is, we write Gp = Gz and call T the closure 
of T. 

For a linear operator T: V+ W with dense domain D(T), we define the 
adjoint T’ : W’ — V’ as follows. There is the identity 


(8.2) (Tv, w') = (v,T'w’), 
forv € D(T), w’ € D(T’) C W’. We define D(T") to be the set of w’ € W’ 
such that the map v + (Tv, w’) extends from D(T’) > C to a continuous, linear 
functional V — C. For such w’, the identity (8.2) uniquely determines T’w’ € V’. 

It is useful to note the following relation between the graphs of T and T”’. The 
graph Gr has annihilator G- C V’ @ W’ given by 
(8.3) Gr={(v',w') eV’ OW’: (Tv, w’) = —(v,v’) forall v € D(T)}. 
Comparing the definition of T’, we see that, with 

I:V’@W SW'OV’, TJ(v',w’) =(w’,-v’), 

we have 
(8.4) Gr = JI GF. 
We remark that D(T) is dense if and only if the right side of (8.4) is the graph 


of a (single-valued) transformation. Using X++ = X for a linear subspace of a 
reflexive Banach space, we have the following. 


Proposition 8.1. A densely defined linear operator T': V — W between reflexive 
Banach spaces has a closure T if and only if T’ is densely defined. T’ is always 
closed, and T" = T. 


If Ho and H; are Hilbert spaces and T : Hp — Hj, with dense domain D(T), 
we define the adjoint T* : H, —> Ho by replacing the dual pairings in (8.2) by 
the Hilbert space inner products. Parallel to (8.4), we have 
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(8.5) Gr = J Gr, 


where J: H) ® H, > H, ® Ho, J(v, w) = (w, —v), and one takes Hilbert space 
orthogonal complements. Again, T has a closure if and only if 7* is densely 
defined, T* is always closed, and T** = T. Note that, generally, the range R(T) 
of T satisfies 


(8.6) R(T)+ = Ker T*. 


A densely defined operator T’: H — H on a Hilbert space is said to be sym- 
metric provided T* is an extension of T (i.e., D(T*) D D(T) and T = T* on 
D(T)). An equivalent condition is that D(T) is dense and 


(8.7) (Tu,v) =(u,Tv), foru,v € D(T). 


If T* = T (so D(T*) = D(T)), we say T is self-adjoint. In light of (8.5), T is 
self-adjoint if and only if D(T) is dense and 


(8.8) Gi= J Gr. 


Note that if T is symmetric and D(T’) = H, then T* cannot be a proper extension 
of T, so we must have T* = T’; hence T is closed. By the closed graph theorem, T' 
must be bounded in this case; this result is called the Hellinger—Toeplitz theorem. 

For a bounded operator defined on all of H, being symmetric is equiva- 
lent to being self-adjoint; in the case of unbounded operators, self-adjointness 
is a stronger and much more useful property. We discuss some results on self- 
adjointness. In preparation for this, it will be useful to note that if T': Hy) — Ay 
has range R(T), and if T' is injective on D(T), then T~! : H, + Hp is defined, 
with domain D(T~') = R(T), and we have 


(8.9) Gpa = 7 Gx: 
Since generally R(T)+ = Ker T*, the following is an immediate consequence. 


Proposition 8.2. If T is self-adjoint on H and injective, then T~+, with dense 
domain R(T), is self-adjoint. 


Here is one useful consequence of Proposition 8.2. 


Proposition 8.3. [fT : H — H is symmetric and R(T) = H, then T is self- 
adjoint. 


Proof. The identity (8.6) implies Ker T = 0 if R(T) = H, so T~! is defined. 
Writing f,g © Has f = Tu, g = Tv, and using 


(T-"f, 9) — (Tu, Ty) = (u, Tv) = (Tu, v) = Ge a), 
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we see that J! is symmetric. Since D(T~!) = H, the Hellinger—Toeplitz the- 
orem implies that T~' is bounded and self-adjoint, so Proposition 8.2 applies to 
Cc , 


Whenever JT’ : Hy — H, is aclosed, densely defined operator between Hilbert 
spaces, the spaces G'y and 7G'r« provide an orthogonal decomposition of Hp @ 
Ai,; that is, 


(8.10) Ay ® Hy = {(v, Tv) + (-T*u,u): v € D(T),ue D(T*)}, 


where the terms in the sum are mutually orthogonal. Using this observation, we 
will be able to prove the following important result, due to J. von Neumann. 


Proposition 8.4. If T : Hy — Hy, is closed and densely defined, then T*T is 
self-adjoint, and I + T*T has a bounded inverse. 


Proof. Pick f € Ho. Applying the decomposition (8.10) to (f,0) € Ho @ Mh, 
we obtain unique v € D(T), u € D(T*), such that 


(8.11) f=v-T*u, u=—Tov. 
Hence 
(8.12) vé€D(T*T) and (1+T*T)v= f. 


Consequently, J+ 7*T : D(T*T) > Hp is bijective, with inverse (I+ T*T)~! : 
Hy — Ho having range D(T*T). Now, with u = (I+ T*T)~'f and v = 
(I+ T*T)~‘g, we easily compute 

(f,(2+T*T)~"9) = (1+ T*T)u, v) 


8.13 
ie = (u,v) + (Tu, Tv) = (I+T*T)“*f,9), 


so (I + T*T)~+ is a symmetric operator on H. Since its domain is H, we have 
(I+T*T)~+ bounded and self-adjoint, and thus Proposition 8.2 finishes the proof. 


If T is symmetric, note that 


(8.14) (0 + 4)ull? = ||Tull? + |lul|?, for u € D(T). 


If T is closed, it follows that the ranges #(T'+7) are closed. See Exercise 6 below. 
The following result provides an important criterion for self-adjointness. 


Proposition 8.5. Let T : H — H be symmetric. The following three conditions 
are equivalent: 


(8.15) T is self-adjoint, 
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(8.16) T is closed and Ker (T* +i) =0, 
(8.17) RT+i)=H. 


Proof. Assume (8.17) holds, that is, both ranges are all of H. Let u € D(T™*); we 
want to show that u € D(T). R(T — 7%) = A implies there exists v € D(T) such 
that (T’—i)v = (T* —i)u. Since D(T) C D(T*), this implies u—v € D(T™) and 
(T* —i)(u—v) = 0. Now the implication (8.17) > Ker(T* +7) = 0 is clear from 
(8.6), so we have u = v; hence u € D(T), as desired. The other implications of 
the proposition are straightforward. 


In particular, if T is self-adjoint on H, T +i: D(T) > H bijectively. Hence 
(8.18) U=(T-i(T+i ':H—H, 


bijectively. By (8.14) this map preserves norms; we say U is unitary. The asso- 
ciation of such a unitary operator (necessarily bounded) with any self-adjoint 
operator (perhaps unbounded) is J. von Neumann’s unitary trick. Note that J — 
U = 2i(T + %)~+, with range equal to D(T’). We can hence recover T from U as 


(8.19) T =i(1+U)(I-U)“}, 


both sides having domain D(T). 

We next give a construction of a self-adjoint operator due to K. O. Friedrichs, 
which is particularly useful in PDE. One begins with the following set-up. There 
are two Hilbert spaces Hp and Hj, with inner products ( , )o and (, )1, respec- 
tively, and a continuous injection 


(8.20) J: Hy; — Ho, 


with dense range. We think of J as identifying H, with a dense linear subspace 
of Ho; given v € Hy, we will often write v for Ju c¢ Ho. A linear operator 
A: Ho — Ho is defined by the identity 


(8.21) (Au, v)o = (u, v)1, 
for all v € Ay, with domain 


(8.22) D(A) = {u €M, C Hp: vb (u,v); extends from H; > C toa 
continuous, conjugate-linear functional Hp > C}. 


Thus the graph of A is described as 


Ga = {(u,w) € Hp ® Ho: u € Hy and 


8.23 
veo (u, v)1 = (w, v)o for all vu € Hy}. 
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We claim that G4 is closed in Hy © Ho; this comes down to establishing the 
following. 


Lemma 8.6. If (un,Wn) € Ga, Un > U, Wn — w in Ho, then u € Hy and 
Un > win Ay. 


Proof. Let Unn = Um — Un, Wmn = Wm — Wn. We know that (Umn,v)1 = 


(Wmn;V)o, for each v € Hy. Taking v = Un gives ||Umn||? = (Wmn; Umn)o > 
0 as m,n — oo. This implies that (w,,) is Cauchy in Hj, and the rest follows. 


Actually, we could have avoided writing down this last short proof, as it will 
not be needed to establish our main result: 


Proposition 8.7. The operator A defined above is a self-adjoint operator on Ho. 
Proof. Consider the adjoint of J, J* : Hg — Hy. This is also injective with 
dense range, and the operator J.J* is a bounded, self-adjoint operator on Ho, that 
is injective with dense range. To restate (8.22), D(A) consists of elements u = Jt 
such that v +> (&,v)1 is continuous in Jv, in the Ho-norm, that is, there exists 
w € Ho such that (a, v)1 = (w, Jv)o, hence t = J*w. We conclude that 

(8.24) D(A) = R(JJ*) 

and, foru € Ho, v € Ay, 

(8.25) (AJ J*u, Ju)o = (J*u,v)1 = (u, Jv)o.- 

It follows that 


(8.26) A=(JJ*)7}, 


and Proposition 8.2 finishes the proof. 


Alternative approach. The inclusion J : H; <> Hp yields a further inclusion, 
J' : Hy ~ H_,, where H_, denotes the dual of H,. Then the inner product on 
7, yields a linear isomorphism 


(8.27) L: Hy —> Hy, (u,Lv)=(u,v)1, u,v € Ay. 
To restate the characterization of A in (8.21)-(8.22), we have 


(8.28) G=L" 7 Ho =; D(A) C A, 
0 


and following this by J : H, — Hp yields 
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(8.29) A= JG. 
Now f € Ho, v= Gf > v € D(A) and Lu = f, hence 
(8.30) (Ju, f)o = (u,Gf)1 


for u € Hj. But the left side is = (u, J* f)1, so 


(8.31) G=J*. 
Hence 
(8.32) Al=JG=JJ", 


and we again have (8.26). 


We remark that, given a closed, densely defined operator T on Hp, one can 
make D(T) = Hj a Hilbert space with inner product (u,v); = (Tu,Tv)o + 
(u, v)9. Thus Friedrichs’ result, Proposition 8.7, contains von Neumann’s result, 
Proposition 8.4. This construction of Friedrichs is used to good effect in Chapter 5, 
and also in Chapter 8. 

We next discuss the resolvent and spectrum of a general closed, densely defined 
operator T : V — V. By definition, ¢ € C belongs to the resolvent set p(T) if 
and only if ¢ — T : D(T) — V, bijectively. Then the inverse 


(8.33) Re= (CT) 72 V 3 0) cv 


is called the resolvent of T; clearly, Re € L(V). As in §5, the complement of 
p(T) is called the spectrum of T and denoted o(T)). 

Such an operator may have an empty resolvent set. For example, the un- 
bounded operator on L?(IR?) defined by multiplication by x1 + ix2, with domain 
consisting of all wu € L?(IR?) such that (a1 + ivg)u € L?(R?), has this prop- 
erty. There are also examples of closed, densely defined, unbounded operators 
with empty spectrum. See Exercise 14 below. Note that Proposition 8.5 implies 
that +2 € p(T’) whenever T is self-adjoint. The same argument shows that any 
¢ € C\ R belongs to p(T), hence o(T) is contained in R, when T is self-adjoint. 

We note some relations between o(T’) and a( Re), given that ¢ € p(T). Clearly, 
0 belongs to p( Rc) if and only if D(T) = V. Since R¢ is bounded, we know that 
its spectrum is a nonempty, compact subset of C. If A € p(Rc), write S, = 
(A — R-)~1. It follows easily that S and Re commute, and both preserve D(T). 
A computation gives 


ror P= (A—Re)S), = AC —T)S(C-—T)* - SC -T) 
=MC-A1-T)S\(¢-T) onV, 
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and similarly, 


PaAC-T) aya 7) (CHT) sy 
=AS\(C-T)*(€—d * —T) on D(T). 


(8.35) 


This establishes the following: 


Proposition 8.8. Given ¢ € p(T), if € p(Re) and X £0, then€—A~! € p(T). 
Hence p(T) is open in C. We have, for such 4, 


(8.36) (C-A*-T) 4 =AA-R)V(C-T)?. 


The second assertion follows from the fact that A € p(R¢) provided |\| > 
Rell. 

If there exists ¢ € (TZ) such that Re is compact, we say T has compact 
resolvent. By Proposition 8.8 it follows that when T has compact resolvent, then 
o(T) is a discrete subset of C. Every resolvent in (8.36) is compact in this case. 
If T is self-adjoint on H with compact resolvent, there exists z € p(T) MR, and 
(z — T)~+ is a compact, self-adjoint operator, to which Proposition 6.6 applies. 
Thus # has an orthonormal basis of eigenvectors of T’: 


(8.37) U7 E D(T), Tv; = AjV5; 


where {A} is a sequence of real numbers with no finite accumulation point. 
Important examples of unbounded operators with compact resolvent arise 
amongst differential operators; cf. Chap. 5. 


Exercises 


1. Let (X, js) be a o-finite measure space, a : X — C measurable. Take p € [1, 00). 
Define 


Ma : L?(X, pu) > L?(X,u), D(Ma) ={f € L(X,u) : af € L?(X, p)}. 
Show that M, is densely defined. Show that M, is closed, i.e., 
fv € D(M,), fu ~ fin L’, af. —~ gin L? > g = af, p-ae.. 


Hint. Set Xn = {x € X : |a(x)| < N}, xw(x) = 1 on Xn, 0 otherwise, an = 
axn. Show that g = af, p-a.e., on each Xn. 

2. In the setting of Exercise 1, take p = 2, and show that MZ = Mg. In particular, 
v€ D(MZ) > v € D(Mia)), Mev = Gv. 
Hint. Use the boundedness of an. 

3. Let T : V — W be densely defined and closed, B € L(V,W). Define T + B by 
D(T + B)=D(T), (+ B)v = Tv + Bu. Show that T + B is closed. 

4. In the setting of Exercise 3, show that 


D((T+ B))=D(T"), (T+ By =T' +B. 


ce 


10. 
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Let 2 C R” be open, P a differential operator with C° coefficients. Take p € [1, 00). 
Define 


P:L?(2)—> L?(Q), D(P) ={f € L?(Q): Pf € L?(Q)}, 


where a priori Pf € D’(2). Show that P is densely defined. Show that P is closed, 
i.e., 


fu €D(P), fu fin L?, Pfy > gin L? > f EDP), Pf=g. 


Hint. Show that the hypotheses imply Pf, — Pf and Pf, — gin D'(2). 
Let T : V — W be closed, with dense domain D(T’). Assume there exists C > 0 such 
that 

||Tul| > Cul], Vue DT). 


Show that T has closed range. 
Hint. Show that if f. € D(T), Tf — g € W, then (f,) is Cauchy. 


. Consider the following operator, which is densely defined on L?(R): 


Tf(z) = fe", D=Cor(R). 
Show that Tis unbounded and also that 7’ has no closure. 
In Exercises 8-12, let J = [—1, 1], 
MN={feb(): fel}, Hol) ={f€ WD): f(-1) = f(1) = 0}. 
Define Ao and A, by 
D(4o) = HAD), D(A)= HD, Ayfa=>&. 


Show that Ao and A; are closed and Aj = Ai. Hence Ao is symmetric, but A; is not. 
Show that 


R(Ao +1)” = Ker(Ai $1) = Span{e**}. 


Assume B is a self-adjoint extension of Ao. Show that 


(B+i)7': L?(1) — D(B) 


extends (Ag + i)~', which is defined on R(Ao +4). Recall that (B — i)(B +i)7' is 
unitary. Show that there exists a € R such that 


(B-)B+i) ee See", 


hence 
—2i(B + i)~1e7* = ee” —e*, 
hence 
fa(x) =e'*e” —e * € D(B) 
Show that 
a 1 1a = ae 
fo(1) . =wes' 
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11. In the setting of Exercise 10, show that 


D(B)={fe HD): fQ)=ef(-}, B= >. 


12. Conversely, show that for each w € S' 1 if B is defined as in Exercise 11, then B is self 
adjoint. Compare Exercise 2 in §9. 


In Exercises 13-14, let J = [0,1], and define Do and D, by 
D(Do) = {f € H*(1): f(0) =0}, D(Di1) ={f € H*(): fA) =0}, Dif = —. 


13. Show that Do and Dj are closed and Dj = —Dy4. 
14. Note that the ODE u’ — Cu = f, u(0) = 0 is solved by 


u(x) =e f e-® Fly) dy. 
0 
Show that each ¢ € C belongs to the resolvent set of Do, and 
(6 — Do)*f(@) =—e [eS Fly) dy 
0 


Deduce that ¢(Do) = 0. Compare Exercise 3 in §9. 
15. Return to the setting of J and HG (I) as in Exercise 8. Apply the Friedrichs construction 


to 
Ho = E7(1), M= Ho (1), (u, v)i = (u’, v')o. 


Show that the self-adjoint operator A obtained by Proposition 8.7 is given by 


_@f 
dx?" 


D(A) ={fe Ho): f" EL (D}, Af= 
See Chapter 5, §1 or Chapter 8, §2 for a substantial multi-dimensional generalization. 


16. In the setting of the second proof of Proposition 8.7, write A~' = JL~'J', and use 
the identity L = L’ to deduce that 


(A-1)* we Pat eae at 


again deducing, via Proposition 8.2, that A is self adjoint. 


9. Semigroups 


If V is a Banach space, a one-parameter semigroup of operators on V is a set of 
bounded operators 


(9.1) P(t): V—V, t€ (0,00), 


satisfying 
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(9.2) P(s+t) = P(s)P(t), 
for all s,¢t € R*, and 
(9.3) P(0) =I. 
We also require strong continuity, that is, 
(9.4) t; >t=— > P(t;)v > P(t)v, 
for each v € V, the convergence being in the V-norm. A semigroup of operators 
will by definition satisfy (9.1)—-(9.4). If P(t) is defined for all t € R and satisfies 


these conditions, we say it is a one-parameter group of operators. 
A simple example is the translation group 


(9.5) T,(t): L?(R) — L?(R), 1l<p<o, 
defined by 
(9.6) Tp(t) f(x) = f(a — t). 


The properties (9.1)—(9.3) are clear in this case. Note that ||T;,(¢) || = 1 for each t. 
Also, ||Z,(t) — T,(t’)|| = 2 if t £ U’; to see this, apply the difference to a function 
f with support in an interval of length |t — t’|/2. To verify the strong continuity 
(9.4), we make the following observation. As noted in §1, the space Coo(R) of 
compactly supported, continuous functions on R is dense in L?(R) for p € [1, co). 
Now, if f € Coo(R), t; — t, then T,(t;) f(x) = f(x —t,;) have support in a fixed 
compact set and converge uniformly to f(a — t), so clearly we have convergence 
in (9.4) in L?-norm for each f € Coo(R). The following simple but useful lemma 
completes the proof of (9.4) for T,. 


Lemma 9.1. Let T; € L(V, W) be uniformly bounded. Let L be a dense, linear 
subspace of V, and suppose 


(9.7) Tjv + Tov, asj > o, 


in the W-norm, for each v € L. Then (9.7) holds for allv € V. 


Proof. Given v € V ande > 0, pick w € L such that ||v — w|| < ©. Suppose 
\|Z;|| <M for all j. Then 


| Zjv — Tov|] < | Zjv — Lyell + |IZjw — Towl| + ||Zow — Too 
< || jw — Tow|| + 2M|jv — w. 


Thus 
lim sup ||Zjv — Tov|| < 2Me, 
jroo 
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which proves the lemma. 


Many examples of semigroups appear in the main text, particularly in Chaps. 3, 
6, and 9, so we will not present further examples here. 
We note that a uniform bound on the norm 


(0.8) P(t)|| <M, fort] <1 
for some M € [1, 00), holds for any strongly continuous semigroup, as a conse- 
quence of the uniform boundedness principle. From (9.8) we deduce that, for all 
teERt, 
(9.9) IPOs Me™, 
for some K; for a group, one would use MZ eK ltl teER. 

Of particular interest are unitary groups—strongly continuous groups of opera- 
tors U(t) on a Hilbert space H such that 
(9.10) U(t)* = U(t)-* = U(—2). 
Clearly, in this case ||U(t)|| = 1. The translation group T2 on L*(R) is a simple 
example of a unitary group. 

A one-parameter semigroup P(t) of operators on V has an infinitesimal gen- 
erator A, which is an operator on V, often unbounded, defined by 

Al Av = lim h7'(P(h)v — 
(9.11) v= lim A~*(P(h)v — 0), 
on the domain 
(9.12) D(A)={veV: lim h—'(P(h)v — v) exists in V}. 


The following provides some basic information on the generator. 


Proposition 9.2. The infinitesimal generator A of P(t) is a closed, densely 
defined operator. We have 


(9.13) P(t)D(A) C D(A), 


for allt € R*, and 


(9.14) AP(t)v = P(t)Av = . P(t)v, forv € D(A). 


Tf (9.9) holds and Re ¢ > K, then ¢ belongs to the resolvent set of A, and 
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(9.15) (¢—A)-v= | et P(t)u dt, veéV. 
0 


Proof. First, if v € D(A), then for ¢ € Rt, 
(9.16) h-"(P(h)P(t)v — P(t)v) = P(t) h7'(P(h)v — v), 
which gives (9.13), and also the first part of (9.14). Furthermore, 
h-"|P(t + h)v — P(t)v] = P(t)h7*(P(h)v — v) > P(t) Av, 
as h \, 0. To check the backwards difference quotient, write, for 0 < h < t, 
h-'[P(t)v — P(t — h)v] = P(t — h)h-1(P(h)v — v) @ P(t) Av, 

as h \, 0, since 

w(h) > win V-norm > P(t — h)w(h) > P(t)u, 


in V-norm. To show that D(A) is dense in V, let v € V, and consider 


Ue = an, P(t)u dt. 
0 


Then 


h-}(P(h)ve — ve) = €7? [a _ P(t)v dt —h7} [ P(t)v at] 
0 


€ 
+e '(P(eju—v), ash, 


so ve € D(A) for each ¢ > 0. But ve > v in V as € — O, by (9.4), so D(A) is 
dense in V. 

Next we prove (9.15). Denote the right side of (9.15) by Re, clearly a bounded 
operator on V. First we show that 


(9.17) Re(¢— A)v =v, forv € D(A). 


In fact, by (9.14) we have 


Re(¢— Aju = : e S*P(t)(Cv — Av) dt 
0 
Co co d 
= | Ce~S* P(t)v dt -f eS' — P(t)u dt, 
and integrating the last term by parts gives (9.17). The same sort of argument 
shows that Re : V + D(A), that (¢ — A)R¢ is bounded on V, and that 
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(9.18) (¢— A)Rev = 0, 


for v € D(A). Since (¢ — A)R¢ is bounded on V and D(A) is dense in V, (9.18) 
holds for all v € V. This proves (9.15). Finally, since the resolvent set of A is 
nonempty, and (¢ — A)~!, being continuous and everywhere defined, is closed, 
so is A. The proof of the proposition is complete. 


We write, symbolically, 
(9.19) P(t) = ef. 


In view of the following proposition, the infinitesimal generator determines 
the one-parameter semigroup with which it is associated uniquely. Hence we are 
justified in saying “A generates P(t).” 


Proposition 9.3. If P(t) and Q(t) are one-parameter semigroups with the same 
infinitesimal generator, then P(t) = Q(t) for allt € Rt. 


Proof. Let v € V and w € V’. Then, for Re ¢ large enough, 


i. e $'(P(t)v,w) dt = ((¢ — A)~'v,w) 
(9.20) oo 
2 | e~(Q(t)v, w) dt. 


0 


Uniqueness for the Laplace transform of a scalar function implies (P(t)v, w) = 
(Q(t)v, w) for allt € Rt and for any v € V and w € V’. Then the Hahn—Banach 
theorem implies P(t)v = Q(t)v, as desired. 


We note that if P(t) is a semigroup satisfying (9.9) and if we have a function 
y € Li(R*, e*'dt), we can define P(y) € L(V) by 


(9.21) P(y)v = 7 p(t) P(t)u dt. 


In particular, this works if ¢ € C§°(0, co). In such a case, it is easy to verify that, 
for all v € V, P(y)v belongs to the domain of all powers of A and 


Co 


(9.22) A® P(y)v = (-1)* f yp") (t)P(t)u dt. 
0 


This shows that all the domains D(A*) are dense in V, refining the proof of 
denseness of D(A) in V given in Proposition 9.2. 

A general characterization of generators of semigroups, due to Hille and 
Yosida, is briefly discussed in the exercises. Here we mention two important spe- 
cial cases, which follow from the spectral theorem, established in Chap. 8. 
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Proposition 9.4. [f A is self-adjoint and positive (i.e, (Au,u) > 0 for wu € 
D(A)), then —A generates a semigroup P(t) = e~*“ consisting of positive, self- 
adjoint operators of norm < 1. 


Proposition 9.5. If A is self-adjoint, then iA generates a unitary group, 
Ui), 


In both cases it is easy to show that the generator of such (semi)groups must be 
of the form hypothesized. For example, if U(t) is a unitary group and we denote 
by 7A the generator, the identity 


(9.23) h-*((U(h) — Tu, v) = ho" (u,[U(—h) — Tv) 


shows that A must be symmetric. By Proposition 9.2, all ¢ € C \ R belong to the 
resolvent set of A, so by Proposition 8.5, A is self-adjoint. If A is self-adjoint, 1A 
is said to be skew-adjoint. 

We now give a criterion for a symmetric operator to be essentially self-adjoint, 
that is, to have self-adjoint closure. This is quite useful in PDE; see Chap. 8 for 
some applications. 


Proposition 9.6. Let Ao be a linear operator on a Hilbert space H, with domain 
D, assumed dense in H. Let U(t) be a unitary group, with infinitesimal generator 
iA, so A is self-adjoint, U(t) = e4. Suppose D C D(A) and Aju = Au for 
u € D, or equivalently 


(9.24) lim h-!(U(h)u—u) = Agu, forallu € D. 


Also suppose D is invariant under U(t): 

(9.25) U(t)D CD. 

Then Ag is essentially self-adjoint, with closure A. Suppose, furthermore, that 
(9.26) Ag: D— D. 

Then Ak, with domain D, is essentially self-adjoint for each positive integer k. 
Proof. It follows from Proposition 8.5 that Ao is essentially self-adjoint if and 


only if the range of i + Ap and the range of i — Ag are dense in H. So suppose 
v © H and (for one choice of sign) 


(9.27) ((i+ Ao)u,v) =0, forallu e€ D. 


Using (9.25) together with the fact that Ag = A on D, we have 


(9.28) (i+ Ao)u,U(t)v) = 0, forallt€R, ue Dz 
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Consequently, { p(t)U(t)v dt is orthogonal to the range of i + Ao, for any p € 
L'(R*). Choosing p € C§°(0, co) an approximate identity, we can approximate 
v by elements of D(A), indeed of D(A*) for all k. Thus we can suppose in (9.27) 
that v € D(A). Hence, taking adjoints, we have 


(9.29) (u,(-i+ A)v) =0, forall ue D. 


Since D is dense in H and Ker(—i + A) = 0, this implies v = 0. This yields 
the first part of the proposition. Granted (9.26), the same proof works with Ag 
replaced by A¥ (but U(t) unaltered), so the proposition is proved. 


This result has an extension to general semigroups which is of interest. 


Proposition 9.7. Let P(t) be a semigroup of operators on a Banach space B, 
with generator A. Let LC D(A) be a dense, linear subspace of B, and suppose 
P(t)£L CL for allt > 0. Then A is the closure of its restriction to L. 


Proof. By Proposition 9.2, we can take K such that || P(t)|| < Me*®, and then 
if Re A > K, we have 

\— A: D(A) = B, 
a topological isomorphism if D(A) is given the graph norm. Hence it suffices to 
show that (A — A)(£) is dense in B. If w € B’ annihilates this range and w 4 0, 
pick u € £ such that (u, w) 4 0. Now 


d 


qh uw) = (AP(t)u, w) = (AP(t)u, w) 


since P(t)u € £L. Thus (P(t)u, w) = e*(u, w). But if Re A > K as above, this 
is impossible unless (uw, w) = 0. This completes the proof. 


We illustrate some of the preceding results by looking at the infinitesimal gen- 
erator A, of the group T,, given by (9.5)-(9.6). By definition, f € L?(R) belongs 
to D(A,) if and only if 


(9.30) bh (f(a —h) - f(z) 


converges in L?-norm as h — 0, to some limit. Now the limit of (9.30) always 
exists in the space of distributions D’(R) and is equal to —(d/da)u, where d/dx 
is applied in the sense of distributions. In fact, we have the following. 


Proposition 9.8. For p € [1, 00), the group T, given by (9.5)(9.8) has infinites- 
imal generator Ap given by 


df 


for f © D(A,), with 
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(9.32) D(Ap) = {f € L?(R): f’ € L(R)}, 
where f' = df /dx is considered a priori as a distribution. 
Proof. The argument above shows that (9.31) holds, with D(A,,) contained in the 
right side of (9.32). The reverse containment can be derived as a consequence of 
the following simple result, taking £ = Cf°(R). 
Lemma 9.9. Let P(t) be a one-parameter semigroup on B, with infinitesi- 
mal generator A. Let L be a weak*-dense, linear subspace of B’, and assume 


P(t)'L C L. Suppose that u,v € B and that 


(9.33) lim h-'(P(h)u—u,w) =(v,w), Vwec. 


Then u € D(A) and Au = v. 


Proof. The hypothesis (9.33) implies that (P(t)u, w) is differentiable and that 


=, (P(t)u, w) = = (P(t) P(s)u,w)| 9 = < (P(s)u, P(t)'w)) 
= (v, P(t)'w) = (P(t)v, w), Vwel. 


s=0 


Actually, this directly gives right differentiability. See Exercise 17 below to get 
full differentiability. Hence 


(P)u-u.w) = f (P(s)v, w) ds, 


for all w € £. The weak* denseness of £ implies P(t)u — u = i P(s)v ds, and 
the convergence in the B-norm of 


h-'(P(h)u—u) =h7" ‘ P(s)uds 
0 


to v as h — 0 follows. 


The space (9.32) is the Sobolev space H1+?(R) studied in Chap. 13; in case p = 2, 
it is the Sobolev space H'(R) introduced in Chap. 4. 
Note that if we define 


(9.34) Ao : Co°(R) — Co°(R), Aof = -—- 


then Proposition 9.7 applies to T;,, p € [1, 00), with B = L?(R), £ = C§°(R), to 
show that, as a closed operator on L?(R), 
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(9.35) A, is the closure of Ag, for p € {1, 00). 


This amounts to saying that C§°(R) is dense in H':?(R) for p € [1,00), which 
can easily be verified directly. 

The fact that a semigroup P(t) satisfies the operator differential equation (9.14) 
is central. We now establish the following converse. 


Proposition 9.10. Let A be the infinitesimal generator of a semigroup. If a func- 
tion u € C([0,T),D(A)) A C1([0, 7), V) satisfies 


d 
(9.36) = Au, u(0) =f, 
dt 
then u(t) = e'f, fort € [0,T). 
Proof. We have e‘’~*)4u(s) differentiable in s € (0, t), and 


Fel Au(s) = —el-9)4 Au(s) + e-9)4 Au(s) = 0, 


hence e('~*)4u(s) has the same value at s = 0 and at s = t, so u(t) = ef. 


We can thus deduce that, given g € C([0,T'),D(A)), f € D(A), the equation 


(9.37) ae Au+g(t), u(0) =f, 


has a unique solution u € C((0,7),D(A)) 9 C1((0, 7), V), and it is given by 


t 
(9.38) u(t) = Af + | e—s)Aq(s) ds. 
0 


This is a version of Duhamel’s formula. 
Indeed, parallel to the proof of Proposition 9.10, we obtain for (9.37) that 


g(s), O<sK<t. 
We can also define a notion of a “weak solution” of (9.37) as follows. Assume 
V is reflexive. If A generates a semigroup, then D(A’) is a dense, linear subspace 
of V’. Let u € C((0, 7), V). Suppose that 
(u(t),v) € C'((0,7)), Vee D(A). 


If fe V,g € C([0,T), V), and 


(9.39) =, (u(t), v) = (u(t), AY) + (g(t), %), uO) = f, 
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we say u(t) is a weak solution to (9.37). 


Proposition 9.11. Given f € V and g € C([0,T),V), (9.37) has a unique weak 
solution, given by (9.38). 


Proof. First, consider (9.38), with f € V, g € C(J,V), and J = [0,T). Let 
f; ~ fin V and g; > gin C(J,V), where f; € D(A) and g; € C'(J,V)N 
C(J,D(A)). Then, by Proposition 9.10 and subsequent comments, 


t 
(9.40) uj(t) = ef; +f ef )4g.(s) ds 
0 


is the unique solution in C1(J,V)  C(J, D(A)) to 


Ou; 

Be = Ata + 9% uj (0) = fy. 
Thus, for any w € D(A’), u; solves (9.39), with g and f replaced by g; and f;, 
respectively, and hence 


t 


(9.41) (us (t), 0) = (f7,¥) +f (u;(s), A’) ds +f (9;(s),W) ds. 


Passing to the limit, we have 


t t 
242) (u(t). v) = (fu) + f (ula), A’¥) ds +f (o(s),0) as 
) 0 
which implies (9.39). 
For the converse, suppose that u € C(J,V) is a weak solution, satisfying 
(9.39), or equivalently, that (9.42) holds. Set y(t) = j for0 < t < 1/j, 0 else- 
where, and consider P(y;), defined by (9.21). We see that 


(Av, P(y;)') = (AP(y;)v,b), for v € D(A). 
Hence P(y;)’: V’ + D(A’), and also 
(v, A'P(y;)'b) = (AP(y;)v, ), for ve V, pe V’. 
If you replace w by P(y;)/w in (9.42), then u;(t) = P(y;)u(t) satisfies (9.41), 
with f; = P(yp;)f, 9;(t) = P(p;)g(£); hence u; € C1(J,V) N C(J, D(A)) is 
given by (9.40), and passing to the limit gives (9.38) for wu. 


We close this section with a brief discussion of when we can deduce that, given 
a generator A of a semigroup and another operator B, then A + B also generates 
a semigroup. There are a number of results on this, to the effect that A + B 
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works if B is “small” in some sense, compared to A. These results are part of the 
“perturbation theory” of semigroups. The following simple case is useful. 


Proposition 9.12. If A generates a semigroup e‘4 on V and B is bounded on V, 
then A+ B also generates a semigroup. 


Proof. The idea is to solve the equation 


(9.43) —=Au+Bu, u(O0)=f, 


by solving the integral equation 


t 
(9.44) u(t) = eAf +f e—9)4 Bu(s) ds. 
0 


In other words, we want to solve 
(9.45) (I —N)u(t) = e*f € C([0, 00), V), 


where 


t 
(0.46)  Nu(t) = i; e'-94 Bu(s) ds, N:C(Rt,V) > C(Rt,V). 
0 


Note that 


t te—-1 ti 
N*u(t) = | | ae f elt—te—-1)A Beltn—-1—te—2)A sad 
(9.47) o Jo 0) 
tee Be'—*)4 Bu(to) dtp +++ dtp_1. 


Hence, if e’4 satisfies the estimate (9.9), 


(9.48) sup |N*u(t)|| < (MI|BI)"e'X - (vol Sf) - sup_ |lu(t)l|, 
0<t<T O<t<T 


where vol $7 is the volume of the k-simplex 


SP ={(to,<++,te-1) 20 < to < +" < the < TH. 


Looking at the case A = 0, B = b (scalar) of (9.43), with solution u(t) = e” f, 
we see that 


TR 


a 
(9.49) vol S;, = a 
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It follows that 
(9.50) So(t) = g(t) + DI N* g(t) 
k=1 
is convergent in C(Rt, V), given g(t) € C(R™, V). Now consider 


(9.51) Q(t) f = ef +S > N*(e'4f). 


k=1 
It is straightforward to verify that Q(t) is a strongly continuous semigroup on V, 
with generator A + B. 


An extension of Proposition 9.12—part of the perturbation theory of R. Phillips— 
is given in the exercises. We mention another perturbation result, due to T. Kato. 
A semigroup P(t) is called a contraction semigroup on V if || P(t)|| < 1 for all 
t>0. 


Proposition 9.13. [f A generates a contraction semigroup on V, then A+ B 
generates a contraction semigroup, provided D(B) > D(A), B is “dissipative,” 
and 


(9.52) IBF ll < MAP| + Call fll, 


for some C, < co and) < 1/2. If V is a Hilbert space, we can allow any 3 < 1. 
To say that B is dissipative means that if u € D(B) C V and u* € V’ satisfies 

(u,u*) = |u|, then 

(9.53) Re (Bu, u*) <0. 

If V is a Hilbert space with inner product (, ), this is equivalent to 


(9.54) Re(Bu,u) <0, foru € D(B). 


Proofs of Proposition 9.13 typically use the Hille—Yosida characterization of 
which A generate a contraction semigroup. See the exercises for further discus- 
sion. 


Exercises 


In Exercises 1-3, define, for J = (0,1), 


0° oo d 
(9.55) Ao : CO (1) > OP (1), Aof = -#. 
1. Given f € L?(), define Ef on R to be equal to f on J and to be periodic of period 
1, and define U(t) : L?(I) > L?(1) by 
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(9.56) U(t) f(x) = (Bfy(a— blr. 


10. 


Show that U(t) is a unitary group whose generator D is a skew-adjoint extension of 
Ao. Describe the domain of D. 


. More generally, for e*® € $1, define Ef on R to equal f on I and to satisfy 


(Ef)(@ +1) =e f(a). 


Then define Ug(t) : L?(I) — L?(1) by (9.56), with this E. Show that Ug(t) is a 
unitary group whose generator Dg is a skew-adjoint extension of Ao. Describe the 
domain of Dg. Compare Exercises 8-12 of §8. 


. This time, define Ef on R to equal f on J and zero elsewhere. For t > 0, define 


P(t) : L°(1) — L?(J) by (9.56) with this E. Show that P(t) is a strongly con- 
tinuous semigroup. Show that P(t) = 0 for ¢ > 1. Show that the infinitesimal 
generator B of P(t) is a closed extension of Ao which has empty spectrum. Describe 
the domain of B. Compare Exercises 13-14 of §8. 


. Let P(t) be a strongly continuous semigroup on the Banach space X, with infinitesi- 


mal generator A. Suppose A has compact resolvent. If K is a closed bounded subset 
of X, show that KC is compact if and only if P(t) — I uniformly on K. (Hint: Let 
T; =h-* . P(t) dt, h = 1/j, and use Exercise 4 of §6.) 


Exercises 5—8 deal with the case where P(t) satisfies (9.1)—(9.3) but the strong conti- 
nuity of P(t) is replaced by weak continuity, that is, convergence in (9.4) holds in the 
a(V, V’)-topology on V. We restrict attention to the case where V is reflexive. 


. If gy € C§°(R*), show that P()v is well defined in V, satisfying 


(P(y)v,w) = is y(t)(P(t)v,w) dt, vEV,wev’. 


. Show that Vo = span{ P(y)v: v € V,p € C§°(R*)} is dense in V. (Hint: Suppose 


w € V’ annihilates Vo.) 


. Show that P(t;)P(y)v = P(~;)v, where y;(T) = v(r — tj) for r > t;, 0 for 


T < t;. Deduce that ast; — t, 
P(t;)P(y)u > P(t)P(y)v, in V-norm, 


for v € V, y € O§°(R*). (Hint: Estimate || P(e; — yo)vl|, with yo(T) = y(r — 2). 
To do this, show that (9.9) continues to hold.) 


. Deduce that the hypotheses on P(t) in Exercises 5—7 imply the strong continuity (9.4). 


(Hint: Use Lemma 9.1.) 


. If P(t) is a strongly continuous semigroup on V, then Q(t) = P(t)’, acting on 


V’', satisfies (9.1)-(9.3), with weak* continuity in place of (9.4). Deduce that if V 
is reflexive, Q(t) is a strongly continuous semigroup on V’. Give an example of P(t) 
on a (nonreflexive) Banach space V for which P(t)’ is not strongly continuous in 
t € [0, oo). 

Extend Proposition 9.12 to show that if A generates a semigroup e’* on V and if 
D(B) > D(A) is such that Be’“ is bounded for t > 0, satisfying 


Be |lev) < Cot™*, t € (0, 1], 


for some a < 1, then A + B also generates a semigroup. 
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(Hint: Show that (9.51) still works. Note that the integrand in the formula (9.57) for 
N* (e!4 f) is of the form - -- Be(1~*0)4 Be’ Ff.) 

11. Recall that P(t) is a contraction semigroup if it satisfies (9.1)-(9.4) and || P(t)|| <1 
for all t > 0. Show that the infinitesimal generator A of a contraction semigroup has 
the following property: 


= 1 

(9.57) A>0=+ AE p(A), and ||(A— A)" <5. 

12. The Hille—Yosida theorem states that whenever D(A) is dense in V and there exist 
A; > 0 such that 


= 1 
(9.58) Aj 7 +00, AE P(A), I]As- ADS 

Jj 
then A generates a contraction semigroup. Try to prove this. (Hint: With X = Aj, set 
Ay = XA(A — A)~1, which is in L(V). Define P(t) = e'“> by the power-series 
expansion. Show that 


(9.59) IPO <1, |[(Pa) -— Pu) fll < tla — Ay) fll, 


and construct P(t) as the limit of P,, (t).) 

13. If P(t) satisfies (9.9), set Q(t) = e~** P(t), so ||Q(t)|| < M for t > 0. Show 
that ||| f||| = sup,so ||Q(¢)f|| defines an equivalent norm on V, for which Q(t) is a 
contraction semigroup. Then, using Exercisess 11 and 12, produce a characterization 
of generators of semigroups. 

14. Show that if P(t) is a contraction semigroup, its generator A is dissipative, in the 
sense of (9.53). 

15. Show that if D(A) is dense, if Ao € p(A) for some Ao such that Re Ap > 0, and 
if A is dissipative, then A generates a contraction semigroup. (Hint: First show that 
the hypotheses imply 4 € p(A) whenever Re A > 0. Then apply the Hille-Yosida 
theorem.) 

Deduce Propositions 9.4 and 9.5 from this result. 

16. Prove Proposition 9.13. (Hint: Show that \ € p(A + B) for some > 0, and apply 

Exercise 15. To get this, show that when A is dissipative and A > 0, A € p(A), then 


|A(Q\— A)" Sf, 


where « = 2 for a general Banach space V, while « = 1 if V is a Hilbert space.) 
17. Let f : I + R be continuous (J = (a, b)) and right-differentiable, i.e., 


Jim, h“[f(a+h)—f(x)|=D,f(@), Vee l. 


Assume D,f = g € C(I). Show that f € C'(I) and f’ = g. Discuss the relevance 

to the proof of Lemma 9.9. Hint. Show that, for f € C(), D- f = 0 => f constant. 
18. Let P(t) € L(V) satisfy (9.2)-(9.3). Assume ||P(t)|| < M for t € [0,1] and 

P(hjv — vash %, 0, for all v € V. Show that P(£) is strongly continuous in 

t € [0, co). 

Hint. Use the identities P(t + h)v — P(t)v = P(t)[P(h)v — v] and P(t)u — P(t — 

h)v = P(t—h)[P(h)v — v]. 
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B 


Manifolds, Vector Bundles, and Lie 
Groups 


Introduction 


This appendix provides background material on manifolds, vector bundles, and 
Lie groups, which are used throughout the book. We begin with a section on 
metric spaces and topological spaces, defining some terms that are necessary for 
the concept of a manifold, defined in §2, and for that of a vector bundle, defined 
in §3. These sections contain mostly definitions; however, a few results about 
compactness are proved. 

In §4 we establish the easy case of a theorem of Sard, a useful result in manifold 
theory. This is used in the development of degree theory in Chap. 1, §20. 

In 85 we introduce the concept of a Lie group G and its Lie algebra g and 
establish the correspondence between Lie subgroups of G and Lie subalgebras of 
g. We also define a Haar measure on a Lie group. In 86 we establish an important 
relation between Lie groups and Lie algebras, known as the Campbell—Hausdorff 
formula. 

In §7 we discuss representations of a Lie group and associated representations 
of its Lie algebra. Some basic results on representations of compact Lie groups 
are given in §8, and in §9 we specialize to the groups SU(2) and SO(3) and to 
some related groups, such as SO(4). Material in §9 is useful in Chap. 8, Spectral 
Theory, particularly in its study of the simplest quantum mechanical model of the 
hydrogen atom. 


1. Metric spaces and topological spaces 


A metric space is a set X together with a distance function d: X x X — [0,00), 
having the properties that 
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d(z,y) =O = r=y, 
Ct) d(x,y) = d(y,2), 
d(xz,y) < d(@@,z) + dy, 2). 


The third of these properties is called the triangle inequality. An example of a 
metric space is the set of rational numbers Q, with d(x, y) = |x — y|. Another 
example is X = R”, with d(x, y) = V/(@1 — y1)2 +-++ + (@n — Yn)?. 

If (x,) is a sequence in X, indexed by v = 1,2,3,... (ie., by v € Z*), one 
says 1, — y if d(x,,y) > 0, as v — oo. One says (x) is a Cauchy sequence 
if d(x,,t,,) > 0 as js, y + oo. One says X is a complete metric space if every 
Cauchy sequence converges to a limit in_X. Some metric spaces are not complete; 
for example, Q is not complete. One can take a sequence (x,,) of rational numbers 
such that x, — 2, which is not rational. Then (a,) is Cauchy in Q, but it has 
no limit in Q. 7 

If a metric space X is not complete, one can construct its completion X as fol- 
lows. Let an element € of X consist of an equivalence class of Cauchy sequences 
in X, where we say (x,) ~ (y,), provided d(x,, y,) — 0. We write the equiv- 
alence class containing (x,) as [x]. If € = [a,] and 7 = [y,], we can set 
A(€,n) = limyp+oo d(xv, yv) and verify that this is well defined and makes x 
a complete metric space. 

If the completion of Q is constructed by this process, you get R, the set of real 
numbers. 

There are a number of concepts related to the notion of closeness, which we 
now discuss. First, if p is a point in a metric space X and r € (0,00), the set 


(1.2) B,(p) = {a € X : d(a,p) < r} 


is called the open ball of radius r, centered at p. Generally, a neighborhood of 
p € X is a set containing such a ball, for some r > 0. 

A set U C X is called open if it contains a neighborhood of each of its points. 
The complement of an open set is said to be closed. The following result charac- 
terizes closed sets. 


Proposition 1.1. A subset Kk of a metric space X is closed if and only if 
(1.3) tre K, x; > peX=pek. 


Proof. Assume K is closed, 2; € K, x; > p. If p ¢ K, then p € X \ K, which 
is open, so some B-(p) C X \ K, and d(a,,p) > ¢ for all 7. This contradiction 
implies p € K. 

Conversely, assume (1.3) holds, and let g ¢ U = X \ K. If By/,(q) is not 
contained in U for any n, then there exist 7, € KM By/,,(q), hence x, — q, 
contradicting (1.3). This completes the proof. 


The following records some straightforward observations. 


Proposition 1.2. If U. is a family of open sets in X, the UU, is open. If Kq is 
a family of closed subsets of X, then \qKq is closed. 
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Given S C X, we denote by 'S (the closure of S) the smallest closed subset of 
X containing S, i.e., the intersection of all the closed sets K containing S. The 
following is an exercise. 


Proposition 1.3. Given S C X, we have p € S if and only if there exist x; € S 
such that x; — p. 


Given S C X, p € X, we say p is an accumulation point of S if and only 
if, for each ¢ > 0, there exists g € SM B.(p), q # p. It follows that p is an 
accumulation point of S if and only if each B.(p), €¢ > O, contains infinitely 
many points of S. One straightforward observation is that all points of S \ S are 
accumulation points of S. 

The interior of a set S C X is the largest open set contained in S, i.e., the 
union of all the open sets contained in S’.. Note that the complement of the interior 
of S is equal to the closure of X \ S. 


Compactness 


We turn now to the notion of compactness. We say a metric space is compact 
provided X # @ and the following property holds: 


(1.4) Each sequence (;,) in X has a convergent subsequence. 


We will examine basic properties of compact metric spaces and provide some 
equivalent characterizations. For example, it is readily seen that (1.4) is equivalent 
to: 


(1.5) Each infinite subset S C X has an accumuation point. 
The following property is known as total boundedness. 


Proposition 1.4. If X is a compact metric space, then 


(1.6) Given é > 0, Ja finite set {a1,...,2n} 
such that B-(#1),..., Be (an) covers x. 
Proof. Take « > 0 and pick 7; € X. If B-(a1) = X, we are done. If not, 
pick v2 € X \ B.(x,). If B(x) U Bz(x2) = X, we are done. If not, pick 
x3 € X\[B-(11)UB-(x2)]. Continue, taking 7,41 € X\[B-(a1)U---UBz(xx)], 
if B.(v1) U---U Be(a,) # X. Note that, for 1 <i,j7 <k, 
i fj d(xi,2)) > 


If one never covers X this way, consider S = {x, : 7 € N}. This is an infinite set 
with no accumulation point, so property (1.5) is contradicted. 


Corollary 1.5. [f X is a compact metric space, it has a countable dense subset. 


Proof. Given « = 2~”, let S,, be a finite set of points x; such that {B.(x;)} 
covers X. Then C = U,,S;, is a countable dense subset of X. 
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The following is a preliminary version of a more general result, to be estab- 
lished in (1.10) below. 


Proposition 1.6. Let X be a compact metric space. Assume K, > Kg D> K3 D 
-++ form a decreasing sequence of closed subsets of X. If each Ky, # , then 
OnkKn # 0. 


Proof. Pick x, € K,,. If (1.4) holds, (x,,) has a convergent subsequence, «,, — 
y. Since {p, : k > €$ C Ky,, which is closed, we have y € NnKn. 


Corollary 1.7. Let X be a compact metric space. Assume U, C Uz C U3 Cc ::: 
form an increasing sequence of open subsets of X. If U,Uyn = X, then Un = X 
for some N. 


Proof. Consider K,, = X \ Un. 


We next establish the following important extension of Corollary 1.7. 


Proposition 1.8. If X is a compact metric space, then it has the property 


(1.7) Every open cover {U,, : a € A} of X has a finite subcover. 


Proof. Each U, is a union of open balls, so it suffices to show that (1.4) implies 
the following: 


(1.8) Every open cover {B, : a € A} of X 
by open balls has a finite subcover. 


Now let C = {z; : 7 € N} C X bea countable dense subset of X, as in Corollary 
1.5. Each By is a union of balls B,.,(z;), with z; € CM Ba, 7; rational. Thus it 
suffices to show that 


(1.9) Every countable cover {B; : 7 € N} of X 
by open balls has a finite subcover. 


For this, we set U,, = By, U---U B,, and apply Corollary 1.7. 
The following is a convenient alternative to (1.7): 


If kK, C X are closed and () K, = 9, 
(1.10) oi 
then some finite intersection is empty. 


In fact, considering U, = X \ Ka, we see that 
(1.7) = > (1.10). 


The following result, known as the Heine-Borel theorem, completes Proposition 
1.8. 
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Theorem 1.9. For a metric space X, 


(1.4) <= (1.7). 


Proof. By Proposition 1.8, (1.4) = (1.7). To prove the converse, it will suffice to 
show that (1.7) = (1.5). 

So assume (1.7) holds and let S Cc X. If p is not an accumulation point of S, 
then there is an open set O, 5 p that contains at most one point of S. If S has no 
accumulation points, then {O, : p € X} is an open cover of X, so by (1.7) is has 
a finite subcover, 

{(Cp,t lS 7s KY]: 


Thus S must be finite. 


If X;, 1 < 7 < ™m, is a finite collection of metric spaces, with metrics d;, we 
can define a product metric space 


(1.11) X= X;, d(x, y) = di(71,91) +--+: +dm(tm, Ym)- 


Another choice of metric is 6(a,y) = \/di(a1, 41)? + +++ + din(@m; Ym). The 
metrics d and 6 are equivalent; that is, there exist constants Co, C, € (0,00) such 
that 


(1.12) Cod(a,y) < d(x, y) < Cid(a,y), Va,ye x. 
We describe some useful classes of compact spaces. 


Proposition 1.10. [f Xj are compact metric spaces, 1 < j < m, so is the product 
m 
space X = |] ,__, Xj. 


Proof. Suppose (2,,) is an infinite sequence of points in X; let us write x, = 
(t1p,---,;2mv). Pick a convergent subsequence (21,,) in X1, and consider the 
corresponding subsequence of (,,), which we relabel (x,,). Using this, pick a 
convergent subsequence (2,,) in X2. Continue. Having a subsequence such that 
Livy — yj in X; for each 7 = 1,...,m, we then have a convergent subsequence 
in X. 


The following result is called the Bolzano-Weierstrass theorem: 


Proposition 1.11. [f Kk is a nonempty, closed bounded subset of R”, then K is 
compact. 


Proof. The discussion above reduces the problem to showing that any closed 
interval JT = [a,b] in R is compact. Suppose S is a subset of J with infinitely 
many elements. Divide J into two equal subintervals, J, = [a, bi], [2 = [b1, 0], 
b; = (a+ 6)/2. Then either J, or J must contain infinitely many elements of S. 
Say I; does. Let x; be any element of S lying in J;. Now divide J; in two equal 
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pieces, 1; = Ij, U Ij2. One of these intervals (say [;,) contains infinitely many 
points of S. Pick x2 € Ij, to be one such point (different from x). Then subdi- 
vide I, into two equal subintervals, and continue. We get an infinite sequence of 
distinct points x, € S, and |x, — ty4~| < 27’(b— a), fork > 1. Since R is 
complete, (x,,) converges, say to y € I. Any neighborhood of y contains infinitely 
many points in S, so we are done. 


If X and Y are metric spaces, a function f : X — Y is said to be continuous 
provided x, — x in X implies f(a.) > f(x) in Y. 


Proposition 1.12. If X and Y are metric spaces, f : X —>+ Y continuous, and 
K Cc X compact, then f(K) is a compact subset of Y. 


Proof. If (y,) is an infinite sequence of points in f(A), pick x, € K such that 
f(xv) = y. If K is compact, we have a subsequence x,, — p in X, and then 


Ui, > fleyiny, 


If F : X — R is continuous, we say f € C(X). A corollary of Proposition 
1.12 is the following: 


Proposition 1.13. If X is a compact metric space and f © C(X), then f assumes 
a maximum and a minimum value on X. 


A function f € C(X) is said to be uniformly continuous provided that, for any 
€ > 0, there exists 6 > 0 such that 


(1.13) x,y €X, d(x,y) <6 = |f(x) — FY) Se. 


An equivalent condition is that f have a modulus of continuity, in other words, a 
monotonic function w : [0, 1) — [0,0o) such that 6 \y 0 = w(d) \, 0 and such 
that 


(1.14) v,yEX, d(r,y) <6 <1 = |f(@) — fy) < (9). 


Not all continuous functions are uniformly continuous. For example, if X = 
(0,1) C R, then f(a) = sin(1/z) is continuous, but not uniformly continuous, 
on X. There is a case where continuity implies uniform continuity: 


Proposition 1.14. If X is a compact metric space and f € C(X), then f is 
uniformly continuous. 


Proof. If not, there exist x,y, € X and e > 0 such that d(x,, y,) < 27” but 


(1.15) f(a) — f(w)| 2. 


Taking a convergent subsequence xr, — p, we also have y,, — p. Now continu- 
ity of f at pimplies f(x,,) > f(p) and f(y,) + f(p), contradicting (1.15). 


If X and Y are metric spaces, the space C'(X, Y) of continuous maps f : X > 
Y has a natural metric structure, under some additional hypotheses. We use 
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(1.16) D(f,9) = sup d(F(2), 9(@)). 

rE 
This sup exists provided f(X) and g(X) are bounded subsets of Y, where to say 
B CY is bounded is to say d : Bx B —> [0, 00) has bounded image. In particular, 
this supremum exists if X is compact. The following result is useful in the proof 
of the fundamental local existence theorem for ODE, in Chap. 1. 


Proposition 1.15. [f X is a compact metric space and Y is a complete metric 
space, then C(X,Y), with the metric (1.16), is complete. 


We leave the proof as an exercise. 
The following extension of Proposition 1.10 is a special case of Tychonov’s 
theorem. 


Proposition 1.16. If {X,; : 7 € Z*+} are compact metric spaces, so is the product 
X= eae Xj. 


Here, we can make X a metric space by setting 


—y- _4ilvi(a),pi(y)) 
ie oe a 1+ dj(pj(2),p;(y)) 


where p; : X — X; is the projection onto the jth factor. It is easy to verify that if 
ty € X, then xz, > yin X, as v — oo, if and only if, for each 7, p;(x_) > p;(y) 


Proof. Following the argument in Proposition 1.10, if (,,) is an infinite sequence 
of points in X, we obtain a nested family of subsequences 


(1.18) Sie) Se 7) eS (ey) 


such that pe(x ») converges in X,, for 1 < @ < j. The next step is a “diagonal 
construction.” We set 


(1.19) & =x’, eX. 


Then, for each j, after throwing away a finite number N(j) of elements, one 
obtains from (£,,) a subsequence of the sequence (2/,,) in (1.18), so pe(E,) con- 
verges in X¢ for all £. Hence (€,,) is a convergent subsequence of (x). 


Topological spaces 


We turn now to the notion of a topological space. This is a set _X, together with 
a family O of subsets, called “open,” satisfying the following conditions: 
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X,@ open, 
N 

U; open, 1 <j N= () U; open, 
j=l 


(1.20) 


U, open,ac A> U Uz. open, 
acA 


where A is any index set. It is obvious that the collection of open subsets of a 
metric space, defined above, satisfies these conditions. As before, a set S C X is 
closed provided X \ S' is open. Also, we say a subset N C X containing p is a 
neighborhood of p provided N contains an open set U that in turn contains p. 

If X is a topological space and S' is a subset, S gets a topology as follows. For 
each U open in X, US is declared to be open in S. This is called the induced 
topology. 

A topological space X is said to be Hausdorff provided that any distinct 
p,q € X have disjoint neighborhoods. Clearly, any metric space is Hausdorff. 
Most important topological spaces are Hausdorff. 

A Hausdorff topological space is said to be compact provided the following 
condition holds. If {U, : a € A} is any family of open subsets of X, covering 
X (ie, X = Lea U,,), then there is a finite subcover, that is, a finite subset 
{Uoy, +++, Uay 1 a7 € A} such that X = U,, U---U Uy, An equivalent 
formulation is the following, known as the finite intersection property. Let {Sq : 
a € A} be any collection of closed subsets of X. If each finite collection of these 
closed sets has nonempty intersection, then the complete intersection (), <4 Sa 
is nonempty. It is not hard to show that any compact metric space satisfies this 
condition. 

Any closed subset of a compact space is compact. Furthermore, any compact 
subset of a Hausdorff space is necessarily closed. 

Most of the propositions stated above for compact metric spaces have exten- 
sions to compact Hausdorff spaces. We mention one nontrivial result, which is the 
general form of Tychonov’s theorem; for a proof, see [Dug]. 


Theorem 1.17. Jf S is any nonempty set (possibly uncountable) and if, for any 
a € S, Xq is a compact Hausdorff space, then so is X = || neg Xa- 


A Hausdorff space X is said to be locally compact provided every p € X has 
a neighborhood WN that is compact (with the induced topology). 

A Hausdorff space is said to be paracompact provided every open cover {Uy : 
a € A} has a locally finite refinement, that is, an open cover {Vg : 3 € B} such 
that each Vg is contained in some Uy and each p € X has a neighborhood N, 
such that NV, Vg is nonempty for only finitely many @ € B. A typical example 
of a paracompact space is a locally compact Hausdorff space X that is also o- 
compact (i.e., X = UP, Xn with X,, compact). Paracompactness is a natural 
condition under which to construct partitions of unity, as will be illustrated in the 
next two sections. 
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A map F' : X — Y between two topological spaces is said to be continuous 
provided F~‘(U) is open in X whenever U is open in Y. If F : X — Y is 
one-to-one and onto, and both F and F'—! are continuous, F' is said to be a home- 
omorphism. For a bijective map F : X — Y, the continuity of F~+ is equivalent 
to the statement that F'(V) is open in Y whenever V is open in X; another equiv- 
alent statement is that F'(.5') is closed in Y whenever S is closed in X. 

If X and Y are Hausdorff, and F' : X — Y is continuous, then F'(/) is 
compact in Y whenever K is compact in X.. In view of the discussion above, there 
arises the following useful sufficient condition for a continuous map Ff’ : X — Y 
to be a homeomorphism. Namely, if X is compact, Y is Hausdorff, and F' is 
one-to-one and onto, then F’ is a homeomorphism. 


2. Manifolds 


A manifold is a Hausdorff topological space with an “atlas,” that is, a covering 
by open sets U; together with homeomorphisms vy; : U; — V;, V; open in R”. 
The number n is called the dimension of /. We say that M is a smooth manifold 
provided the atlas has the following property. If U;, = U; Ux # 0, then the map 


(2.1) dik: pj(Ujx) > pr(U5r), 


given by yz © Qi" is a smooth diffeomorphism from the open set y;(U;,) to the 
open set y,(U;,) in R”. By this, we mean that qj, is C™, with a C°°-inverse. 
If the 4,4 are all C’-smooth, M is said to be C’-smooth. The pairs (U;, pj) are 
called local coordinate charts. 

A continuous map from / to another smooth manifold N is said to be smooth 
if it is smooth in local coordinates. Two different atlases on M, giving a priori 
two structures of 7 as a smooth manifold, are said to be equivalent if the identity 
map on JW is smooth from each one of these two manifolds to the other. Actually, 
a smooth manifold is considered to be defined by equivalence classes of such 
atlases, under this equivalence relation. 

One way manifolds arise is the following. Let fi,..., fx be smooth func- 
tions on an open set U C R". Let M = {x € U: f(x) = c;}, for a given 
(c1,...,¢x) € R*. Suppose that M # 0) and, for each x € M, the gradients Vf; 
are linearly independent at x. It follows easily from the implicit function theorem 
that / has a natural structure of a smooth manifold of dimension n — k. We say 
M is a submanifold of U. More generally, let F : X —> Y be a smooth map 
between smooth manifolds, c € Y, M = F~1'(c), and assume that M # @) and 
that, at each point x € M, there is a coordinate neighborhood U of x and V of c 
such that the derivative DF at x has rank k. More pedantically, (U, y) and (V, 2) 
are the coordinate charts, and we assume the derivative of =o F'o yo} has rank k 
at y(«); there is a natural notion of DF(x) : T,X — TY, which will be defined 
in the next section. In such a case, again the implicit function theorem gives MZ 
the structure of a smooth manifold. 
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We mention a couple of other methods for producing manifolds. For one, given 
any connected smooth manifold , its universal covering space M has the natural 
structure of a smooth manifold. M can be described as follows. Pick a base point 
p € M. For x € M, consider smooth paths from p to x, 7 : [0,1] > M. We 
say two such paths yo and 7 are equivalent if they are homotopic, that is, if 
there is a smooth map a0 : I x I — M(I = [0,1]) such that o(0,t) = y(t), 
o(1,t) = y(t), o(s,0) = p, and o(s, 1) = «. Points in M lying over any given 
x € M consist of such equivalence classes. 

Another construction produces quotient manifolds. In this situation, we have a 
smooth manifold MM and a discrete group I’ of diffeomorphisms on /. The quo- 
tient space I’ \ M consists of equivalence classes of points of 7, where we set 
x ~ (2) for each a € M,7 € I. If we assume that each x € M has a neighbor- 
hood U containing no 7(x), for y # e, the identity element of I’, then I \ M has 
a natural smooth manifold structure. 

We next discuss partitions of unity. Suppose M is paracompact. In this case, 
using a locally finite covering of M by coordinate neighborhoods, we can con- 
struct ; € C§°(M) such that, for any compact kK C M, only finitely many 7; 
are nonzero on KK (we say the sequence 7; is locally finite) and such that, for any 
p © M, some w,;(p) # 0. Then 


(2.2) 9;(z) = (= vst) bj (x)? 
k 


is a locally finite sequence of functions in Co°(M), satisfying )?, p;(x) = 1. 
Such a sequence is called a partition of unity. It has many uses. 

Using local coordinates plus such cut-offs as appear in (2.2), one can easily 
prove that any smooth, compact manifold M7 can be smoothly imbedded in some 
Euclidean space R%, though one does not obtain so easily Whitney’s optimal 
value of N (N = 2dim M + 1, valid for paracompact M, not just compact M), 
proved in [Wh]. 

A more general notion than manifold is that of a smooth manifold with bound- 
ary. In this case, M is again a Hausdorff topological space, and there are two 
types of coordinate charts (U;,y,;). Either y; takes U; to an open subset V; 
of IR” as before, or y; maps U; homeomorphically onto an open subset of 
Ri = {(71,...,%n) € R" : a, > O}. Again appropriate transition maps are 
required to be smooth. In case M is paracompact, there is again the construction 
of partitions of unity. For one simple but effective application of this construction, 
see the proof of the Stokes formula in §13 of Chap. 1. 


3. Vector bundles 


We begin with an intrinsic definition of a tangent vector to a smooth manifold 7, 
at a point p € M. It is an equivalence class of smooth curves through p, that is, 
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of smooth maps y : J > M, J an interval containing 0, such that y(0) = p. The 
equivalence relation is y ~ 1 provided that, for some coordinate chart (U, y) 
about p, yp: U — V C R”, we have 


G.1 SF (von(0) = $(vo7n)(0) 


This equivalence is independent of the choice of coordinate chart about p. 

If V Cc R” is open, we have a natural identification of the set of tangent vectors 
to V at p € V with R”. In general, the set of tangent vectors to M at p is denoted 
T,M. A coordinate cover of M induces a coordinate cover of TM, the disjoint 
union of T,,M as p runs over M, making T’M a smooth manifold. TM is called 
the tangent bundle of M. Note that each T,, M has the natural structure of a vector 
space of dimension n, if n is the dimension of M/. If F : X — M isasmooth map 
between manifolds, « € X, there is a natural linear map DF'(a) : T,X > T,M, 
p = F(x), which agrees with the derivative as defined in §1 of Chap. 1, in local 
coordinates. DF'(«) takes the equivalence class of a smooth curve + through « to 
that of the curve F' 0 y through p. 

The tangent bundle TM of a smooth manifold M is a special case of a vector 
bundle. Generally, a smooth vector bundle £ — M is a smooth manifold EF, 
together with a smooth map 7 : EL —> M with the following properties. For 
each p € M, the “fiber” E,, = ~'(p) has the structure of a vector space, of 
dimension k, independent of p. Furthermore, there exists a cover of M by open 
sets U;, and diffeomorphisms @; : ~1(U;) + U; x R* with the property that, 
for each p € U;, ®; : Ep > {p} x R*® > R* is a linear isomorphism, and if 
Uje = U; Ue #0, we have smooth “transition functions” 


G2) By 0 G7! = Wyp : Uye x R* > Uze x R*, 


which are the identity on the first factor and such that for each p € Uje, Yje(p) is 
a linear isomorphism on R*. In the case of complex vector bundles, we systemat- 
ically replace R® by C* in the discussion above. 

The structure above arises for the tangent bundle as follows. Let (U;,y;) be a 
coordinate cover of M, py; : U; — V; C R”. Then &; : TU; — U; x R” takes 
the equivalence class of smooth curves through p € U; containing an element 7 
to the pair (p, (yp; 0 y)'(0)) € U; x R”. 

A section of a vector bundle E — M is a smooth map 6 : M — E such 
that 7(G(p)) = p for all p € M. For example, a section of the tangent bundle 
TM — M isa vector field on M. If X is a vector field on M, generating a flow 
F', then X (p) € T,,M coincides with the equivalence class of y(t) = F'p. 

Any smooth vector bundle & — M has associated a vector bundle E* — M, 
the “dual bundle” with the property that there is a natural duality of E, and E 
for each p € M. Incase EF is the tangent bundle 7’, this dual bundle is called 
the cotangent bundle and is denoted T* M. 

More generally, given a vector bundle & —> M, other natural constructions 
involving vector spaces yield other vector bundles over M/, such as tensor bundles 
@/ E — M with fiber @/ E,,, mixed tensor bundles with fiber (®/ E,) ® (@*E*), 
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exterior algebra bundles with fiber AE,,, and so forth. Note that a k-form, as 
defined in Chap. |, is a section of A*T*M. A section of (@’T) ® (@*T*)M 
is called a tensor field of type (j, k). 

A Riemannian metric tensor on a smooth manifold M is a smooth, symmet- 
ric section g of @7*M that is positive-definite at each point p € M; that is, 
Gp(X,X) > 0 for each nonzero X € T,M. For any fixed p € M, using a local 
coordinate patch (U, ) containing p, one can construct a positive, symmetric sec- 
tion of @?7*U. Using a partition of unity, we can hence construct a Riemannian 
metric tensor on any smooth, paracompact manifold /. If we define the length of 
a path y : [0,1] > M to be 


ity) = | a(t), 7) at, 
then 


(3.3) d(p,q) = inf{L(y) : y(0) = p, yA) = a} 


is a distance function making M a metric space, provided M is connected. 

The notion of vector bundle often aids in making intrinsic definitions of impor- 
tant mathematical concepts. As an illustration, we note the following intrinsic 
characterization of the contact form « on J*M, which was specified in local 
coordinates in (15.17) of Chap. 1. Let z € T*M; if 7 : T* M — M is the natural 
projection, let p = m(z), so z € TM. To define x at z, as K(z) € T3(7*M), we 
specify how it acts on a tangent vector v € T,(T*M). The specification is 


(3.4) (v, «(z)) = ((Dr)u, z), 


where Dr : T,(T*M) — T,M is the derivative of 7, and the right side of (3.4) 
is defined by the usual dual pairing of TM and T; M. It is routine to check 
that this agrees with (15.17) of Chap. 1 in any coordinate system on M/. This 
establishes again the result of §14 of Chap. 1, that the symplectic form o = dk is 
well defined on a cotangent bundle 7* MV. 


4. Sard’s theorem 


Let F : Q > R” be a C!-map, with 2 open in R”. If p € Q and DF(p) : R" = 
IR” is not surjective, then p is said to be a critical point and F'(p) a critical value. 
The set C of critical points can be a large subset of §2, even all of it, but the set of 
critical values F'(C’) must be small in R”. This is part of Sard’s theorem. 


Theorem 4.1. Jf F : Q — R"” is a C!-map, then the set of critical values of F 
has measure 0 in R”. 


Proof. If K C 9 is compact, cover kK MC with m-dimensional cubes Q;, with 
disjoint interiors, of side 6;. Pick p; € CNQ;, so L; = DF (p;) has rank < n—1. 
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Then, for z € Q;, 
F(pj +x) = F(pj)+ Ljx+ R(x), || Rj(x)|| < pj = 05), 


where 7; — 0 as 6; — 0. Now L,;(Q,) is certainly contained in an (n — 1)- 
dimensional cube of side Cod;, where Cp is an upper bound for \/m||DF'| on kK. 
Since all points of F'(Q,) are a distance < p, from (a translate of) L;(Q;), this 
implies 

meas F'(Qj) < 2pj(Cod; + 2p;)""* < Cin}, 


provided 4, is sufficiently small that p; < 6;. Now >> ; 9; is the volume of the 
cover of kK MC. For fixed K, this can be assumed to be bounded. Hence 


meas F(CN K) < Cx n, 


where 7) = max {7}. Picking a cover by small cubes, we make 7 arbitrarily small, 
so meas F(C'N Kk.) = 0. Letting kK; /“ 2, we complete the proof. 


Sard’s theorem also treats the more difficult case when 2 is open in R™,m > 
n. Then a more elaborate argument is needed, and one requires more differentia- 
bility, namely that F is class C*, with k = m — n+ 1. A proof can be found in 
[Stb]. The theorem also clearly extends to smooth mappings between separable 
manifolds. 

Theorem 5.1 is applied in Chap. 1, in the study of degree theory. We give 
another application of Theorem 5.1, to the existence of lots of Morse functions. 
This application gives the typical flavor of how one uses Sard’s theorem, and it is 
used in a Morse theory argument in Appendix C. The proof here is adapted from 
one in [GP]. We begin with a special case: 


Proposition 4.2. Let 2 Cc R” be open, f © C™%(Q2). Fora € R”, set fa(x) = 
f(x) —a-«. Then, for almost every a € R", f, is a Morse function, that is, it has 
only nondegenerate critical points. 


Proof. Consider F(x) = Vf(x); F : Q > R". A point x € 2 is a critical 
point of f, if and only if F(a) = a, and this critical point is degenerate only if, 
in addition, a is a critical value of F’. Hence the desired conclusion holds for all 
a € R” that are not critical values of F’. 


Now for the result on manifolds: 


Proposition 4.3. Let M be an n-dimensional manifold, imbedded inR*. Let f € 
C™(M), and, fora € R*, let fa(x) = f(x)—a-a, forx € M C R*. Then, for 
almost alla € R*, fq is a Morse function. 


Proof. Each p € M has a neighborhood 2, such that some n of the coor- 
dinates x, on R* produce coordinates on Qp. Let’s say £1,...,2p do it. Let 
(Gn41,--+,@x) be fixed, but arbitrary. Then, by Proposition 5.2, for almost every 
(a1,.--,@n) € R”, fa has only nondegenerate critical points on 2,. By Fubini’s 
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theorem, we deduce that, for almost every a € R*, f, has only nondegenerate 
critical points on §2,. (The set of bad a € R* is readily seen to be a countable 
union of closed sets, hence measurable.) Covering M by a countable family of 
such sets §2,, we finish the proof. 


5. Lie groups 


A Lie group G is a group that is also a smooth manifold, such that the group 
operations G x G — G and G — G given by (g,h) > gh andg +> g7! are 
smooth maps. Let e denote the identity element of G. For each g € G, we have 
left and right translations, L, and R,, diffeomorphisms on G, defined by 


(5.1) L,(h) =gh, R,(h) = hg. 
The set of left-invariant vector fields X on G, that is, vector fields satisfying 
(5.2) (DL4)X(h) = X(gh), 


is called the Lie algebra of G, and is denoted g. If X, Y € g, then the Lie bracket 
[X, Y] belongs to g. Evaluation of X € g at e provides a linear isomorphism of g 
with T.G. 

A vector field X on G belongs to g if and only if the flow F{ it generates 
commutes with L, for all g € G, that is, g( Fh) = FX (gh) for all g,h € G. If 
we set 


(5.3) x(t) = Fre, 

we obtain 7x (t+ s) = F}(Fye)-e = (Fye)(F Xe), and hence 

(5.4) yx(s +t) = yx(s)7x(t), 

for s,t € R; we say yx is a smooth, one-parameter subgroup of G. Clearly, 
(5.5) x (0) = X(e). 


Conversely, if 7 is any smooth, one-parameter group satisfying 7/(0) = X(e), 
then F'g = g-7/(t) defines a flow generated by the vector field X € g coinciding 
with X(e) at e. 

The exponential map 


(5.6) Exp:g—G 
is defined by 
(5.7) Exp(X) = yx(1). 


Note that 7,x (t) = yx(st), so Exp(tX) = yx (t). In particular, under the identi- 
fication g > T.G, 
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(5.8) D Exp(0) : T.G —+ T.G is the identity map. 


The fact that each element X € g generates a one-parameter group has the 
following generalization, to a fundamental result of S. Lie. Let h C g be a Lie 
subalgebra, that is, h is a linear subspace and X; € h => [X1, X2] € 6. By 
Frobenius’s theorem (established in §9 of Chap. 1, through each point p of G' there 
is a smooth manifold M,, of dimension k = dim h, which is an integral manifold 
for § (i.e., § spans the tangent space of M,, at each q € M,). We can take M, 
to be the maximal such (connected) manifold, and then it is unique. Let H be the 
maximal integral manifold of h containing the identity element e. 


Proposition 5.1. H is a subgroup of G. 


Proof. Take ho € H and consider Hy = ho 1H; clearly, e € Ho. By left 
invariance, Ho is also an integral manifold of h, so Hp = H. This shows that 
hohe H=> ho thi € H, so H is a group. 


In addition to left-invariant vector fields on G, one can consider all left- 
invariant differential operators on G. This is an algebra, isomorphic to the “uni- 
versal enveloping algebra” L(g), which can be defined as 


(5.9) Ug) = &) ac/J, 


where gc is the complexification of g and J is the two-sided ideal in the tensor 
algebra ®) gc generated by {XY — YX — [X,Y]: X,Y € g}. 

There are other classes of objects whose left-invariant elements are of par- 
ticular interest, such as tensor fields (particularly metric tensors) and differential 
forms. 

Given any ao € Ak TG, there is a unique k-form a on G, invariant under L,, 
that is, satisfying Loa = a for all g € G, equal to ao at e. In case k = n = 
dim G, if wo is a nonzero element of A” TG, the corresponding left-invariant 
n-form w on G defines also an orientation on G, and hence a left-invariant volume 
form on G, called a (left) Haar measure. It is uniquely defined up to a constant 
multiple. Similarly one has a right Haar measure. It is very important to be able 
to integrate over a Lie group using Haar measure. 

In many but not all cases left Haar measure is also right Haar measure; then G 
is said to be unimodular. Note that if w € A”(G) gives a left Haar measure, then, 
for each g € G, Row is also a left Haar measure, so we must have 


(5.10) Ryw = u(g)w, pw: G — (0,00). 


Furthermore, (gg’) = u(g)u(g'). If G is compact, this implies j4(g) = 1 for all 
g, so all compact Lie groups are unimodular. 

There are some particular Lie groups that we want to mention. Let n € Zt 
and F' = R or C. Then Gl(n, F’) is the group of all invertible n x n matrices with 
entries in £’. We set 


(5.11) Sl(n, F) = {A € Gi(n, F) : det A = 1}. 
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We also set 

(5.12) O(n) = {A € Gl(n,R): At = A}, 
SO(n) = {A € O(n) : det A = “44, 

and 

5.13) U(n) = {A € Gl(n,C) : A* = A~*}, 


SU(n) = {AE U(n rove 


The Lie algebras of the groups listed above also have special names. We have 
gl(n, F) = M(n, F), the set of n x n matrices with entries in F’. Also, 
sl(n, F) = {A € M(n, F): Tr A= 0}, 
o(n) = so(n) = {A € M(n,R): At = —A}, 
u(n) = {A € M(n,C) : A* = —A}, 
su(n) = {A € u(n) : Tr A = OF. 


(5.14) 


There are many other important matrix Lie groups and Lie algebras with 
special names, but we will not list any more here. See [Helg, T], or [Varl] for 
such lists. 


6. The Campbell—Hausdorff formula 


The Campbell—Hausdorff formula has the form 
(6.1) Exp(X) Exp(Y) = Exp(C(X,Y)), 


where G is any Lie group, with Lie algebra g, and Exp: g — G is the exponential 
map defined by (5.7); X and Y are elements of g in a sufficiently small neighbor- 
hood U of zero. The map C : U x U — g has a universal form, independent of g. 
We give a demonstration similar to one in [HS], which was also independently 
discovered by [Str]. 

We begin with the case G = Gl(n,C) and produce an explicit formula for the 
matrix-valued analytic function X(s) of s in the identity 


(6.2) eX(9) = ia le 
near s = 0. Note that this function satisfies the ODE 


(6.3) oe = eX(s)y, 
S 


We can produce an ODE for X(s) by using the following formula, derived in 
Exercises 7-10 of §4, Chap. 1: 
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1 
(6.4) A AX(s) = ere] eTX(8) X"(s)e™X) dr, 
ds 0 
As shown there, we can rewrite this as 
d 
(6.5) ae = eX') 2 (ad X(s)) X"(s). 


Here, ad is defined as a linear operator on the space of n x n matrices by 
(6.6) ad X(Y) = XY —YX; 


the function = is 


{] 


(6.7) 


z 


1 1 _ pz 
(z) =| e *dr= 7 ; 
0 


an entire holomorphic function of z; and a holomorphic function of an operator is 
defined either as in Exercise 10 of that set, or as in §5 of Appendix A. Comparing 
(6.3) and (6.5), we obtain 


(6.8) 5 (ad X(s))X"(s) =Y, X(0)=X. 
We can obtain a more convenient ODE for X(s) as follows. Note that 
(6.9) ead X(s) = Ad eX (s) = Ad e* . Ad es = el Xx e ad Y 


Now let Y(¢) be holomorphic near ¢ = 1 and satisfy 


1 a 
(6.10) W(e*) = Ea) a Ee 
explicitly, 

l 

(6.11) (6) = s oe6 
It follows that 
(6.12) w(et tet! \S(ad X(a)) = I, 
so we can transform (6.8) to 
(6.13) M(gQaovetere ly, AC) =X. 


Integrating gives the Campbell-Hausdorff formula for X(s) in (6.2): 
(6.14) X(s) =X +f ee ee NY ae 
0 


This is valid for ||sY'|| small enough, if also X is close enough to 0. 
Taking the s = 1 case, we can rewrite this formula as 
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1 
(6.15) eXeY =C(*XY) CCX, Y) =X +f w (eX et@¥)Y dt. 

0 
The formula (6.15) gives a power series in ad X and ad Y which is norm- 
summable provided 


(6.16) Jad X|| <2, |lad Y|| <y, 
with e*+¥ — 1 < 1, thatis, 
(6.17) at+y <log2. 


We can extend the analysis above to the case where X and Y are vector fields 
on a manifold M, asking for a vector field X(s) such that 


(6.18) Fis) = FxFy, 


where Fi, is the flow generated by X, evaluated at time t. If there is such a family 
X(s), depending smoothly on s, material in §6 of Chap. 1, in place of material in 
84 cited above, leads to a formula parallel to (6.4), and hence to (6.8), in this 
context. However, we cannot always solve (6.8), because ad X(s) tends not to act 
as a bounded operator on a Banach space of vector fields, and in fact one cannot 
always solve (6.18) for X(s) is this case. However, if there is a finite-dimensional 
Lie algebra g of vector fields containing X and Y, then the analysis (6.9)-(6.17) 
extends. We have 


(6.19) FFY = Fou xy): 
with 
1 
(6.20) Ct, X,Y) =X + | a ( gee VY aa, 
0 


provided ||ad ¢X'|| + ||ad tY || < log 2, the operator norm ||ad X'|| being computed 
using any convenient norm on g. In particular, if 1/ = G is a Lie group with Lie 
algebra g, and X,Y € g, this analysis applies to yield the Campbell—Hausdorff 
formula for general Lie groups. 


7. Representations of Lie groups and Lie algebras 


We define a representation of a Lie group G on a finite-dimensional vector space 
V to be a smooth map 


(7.1) ma~:G—> End(V) 
such that 


(7.2) me)=I, r(gg')=7(g9)n(9'), 9,9'€G. 
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If F € Co(G), that is, if F’ is continuous with compact support, we can define 
(EF) € End(V) by 


(7.3) Pyv= [ Fon )u dg. 


We get different results depending on whether left or right Haar measure is used. 
Right now, let us use right Haar measure. Then, for g € G’, we have 


(7.4) w(F)r(g)v = [ Fortes» dx = [Fer raw dx. 


G G 


We also define the derived representation 


(7.5) dn: g —> End(V) 
by 
(7.6) dx = Dr(e):T-G —> End(V), 


using the identification g + T.G. Thus, for X € g, 


(7.7) dx(X)v = lim * [(Exp tX)v —v]. 


t—0 


The following result states that dz is a Lie algebra homomorphism. 
Proposition 7.1. For X,Y © g, we have 
(7.8) [dr (X),dx(Y)] = dr([X,Y]). 


Proof. We will first produce a formula for 7(F’)d7(X), given F € C§°(G). 
In fact, making use of (7.4), we have 


1 


n(F)de(X)u = lim + f [F(g)e(g)n(Exp tX) — F(g)r()]v dg 
G 
a = tim > | [F(9-Exp(-tX)) — F(@)] (9)v ag 
G 
—n(XF)v, 


where X F' denotes the left-invariant vector field X applied to F’. It follows that 


m(F) [dn(X)da(¥) — dr(Y)da(X)]v 


(7.10) 
=n(YXF—XYF)v = —7([X,Y]F)v, 


which by (7.9) is equal to 7(F')d([X, Y])v. Now, if F' is supported near e € G 
and integrates to 1, is easily seen that 7(F’) is close to the identity J, so this implies 
(7.8). 
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There is a representation of G' on g, called the adjoint representation, defined 
as follows. Consider 


(7.11) K,:G—G, K,(h)=ghg™'. 
Then A‘,(e) = e, and we set 
(7.12) Ad(g) = DK,(e): T-G — T.G, 


identifying T.G ~ g. Note that K, o Ky: = Kgg:, so the chain rule implies 
Ad(g)Ad(9’) = Ad(gq’). 

Note that 7(t) = g Exp(tX)g~ is a one-parameter subgroup of G satisfying 
y'(0) = Ad(g).X. Hence 


(7.13) Exp(t Ad(g)X) = g Exp(tX) gt. 
In particular, 
(7.14) Exp((Ad Exp sY )tX) = Exp(sY)Exp(tX) Exp(—sY). 


Now, the right side of (7.15) is equal to Fy-* o F{ o F}-(e), so by results on the 
Lie derivative of a vector field given in (8.1)-(8.3) of Chap. 1, we have 


(7.15) Ad(Exp sY)X = Fi. X. 


If we take the s-derivative at s = 0, we get a formula for the derived representation 
of Ad, which is denoted ad, rather than d Ad. Using (8.3)-(8.5) of Chap. 1, we 
have 


(7.16) ad(Y)X = [Y, X]. 


In other words, the adjoint representation of g on g is given by the Lie bracket. 
We mention that Jacobi’s identity for Lie algebras is equivalent to the statement 
that 


(7.17) ad([X,Y]) = [ad(X),ad(Y)], V X,Y eg. 


If V has a positive-definite inner product, we say that the representation (7.1) is 
unitary provided 7(g) is a unitary operator on V, for each g € G (i.e., 7(g)~+ = 
™(g)*). 

We say the representation (7.1) is irreducible if V has no proper linear sub- 
space invariant under z(g) for all g € G. Irreducible unitary representations are 
particularly important. The following version of Schur’s lemma is useful. 


Proposition 7.2. A unitary representation 7 of G on V is irreducible if and only 
if, for any A € End(V), 


(7.18) t(g)A=An(g), Vg Ee G=> A=DN. 


Proof. First, suppose 7 is irreducible and A commutes with 7(g) for all g. Then 
so does A*, hence A+.A* and (1/i)(A—A*), so we may as well suppose A = A*. 
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Now, any polynomial p(A) commutes with 7(g) for all g, so it follows that each 
projection P, onto an eigenspace of A commutes with all 7(g). Hence the range 
of Py is invariant under 7, so if P, 4 0, it must be J, and A = XI. 

Conversely, suppose the implication (7.18) holds. Then if W C V is invariant 
under 7r, the orthogonal projection P of V onto W must commute with all z(q), 
so P is a scalar multiple of J, hence either 0 or J. This completes the proof. 


Corollary 7.3. Assume G is connected. Then a unitary representation of G on V 
is irreducible if and only if, for any A € End(V), 


(7.19) dr(X)A=Adn(X), VX €g=> A=NI. 
Proof. We mention that 

(7.20) a(Exp tX) = ef @(*) 

and leave the details to the reader. 


Given a representation 7 of G on V, there is also a representation of the uni- 
versal enveloping algebra L(g), defined as follows. If 


(7.21) P= S- Ciy wi, Xi Xi, Xz Eg, 
usm 

with ¢;,...;, © C, we have 

(7.22) dr(P) = S> ci,..i,dt(Xi,) +++ dr (Xi,). 


wm 
Proposition 7.4. Suppose G is connected. Let P € S(g), and assume 
(7.23) PX=XP, VWXeqg. 


If x is an irreducible unitary representation of G on V, then dr(P) is a scalar 
multiple of the identity, that is, 


dr(P) = XI. 
Proof. Immediate from Corollary 7.3. 


So far in this section we have concentrated on finite-dimensional representa- 
tions. It is also of interest to consider infinite-dimensional representations. One 
example is the right-regular representation of G on L?(G): 


(7.24) R(g) f(x) = f(g). 


If G has right-invariant Haar measure, then R(g) is a unitary operator on L?(G) 
for each g € G, and one readily verifies that R(g) R(g’) = R(gg’). However, the 
smoothness hypothesis made on 7 in (7.1) does not hold here. When working with 
an infinite-dimensional representation 7 of G on a Banach space V, one makes 
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instead the hypothesis of strong continuity: For each v € V, the map g +> m(g)v is 
continuous from G to V, with its norm topology. If the map is C'°°, one says v is a 
smooth vector for the representation v. For example, each f € C§°(G) is asmooth 
vector for the representation (7.24). Of course, C§°(G) is dense in L?(G). More 
generally, the set of smooth vectors for any strongly continuous representation 7 
of G on a Banach space V is dense in V. In fact, for F € C§°(G), 7(F) is still 
well defined by (7.3), and the space 


(7.25) G, ={n(F)u: F€ CY(G),v € V} 


is readily verified to be a dense subspace of V consisting of smooth vectors. If V 
is finite dimensional, this implies that G, = V, so any strongly continuous, finite- 
dimensional representation of a Lie group automatically possesses the smoothness 
property used above. 

The occasional use made of Lie group representations in this book will not 
require much development of the theory of infinite-dimensional representations, 
so we will not go further into it here. One can find treatments in many places, 
including [HT, Kn, T, Var2, Wal1]. 


8. Representations of compact Lie groups 


Throughout this section, G will be a compact Lie group. If 7 is a representation 
of G on a finite-dimensional complex vector space V, we can always put an inner 
product on V so that 7 is unitary. Indeed, let ((u,v)) be any Hermitian inner 
product on V, and set 


(8.1) (u,v) = / (n(g)u, m(g)v)) dg. 


G 


Note that if V, is a subspace of V invariant under 7(g) for all g € G, and if 
7 is unitary, then the orthogonal complement of Vj is also invariant. Thus, if 7 is 
not irreducible on V, we can decompose it, and we can obviously continue this 
process only a finite number of times if dim V is finite. Thus 7 breaks up into a 
direct sum of irreducible unitary representations of G. 

Let 7 and be two representations of G, on V and W, respectively. We say 
they are equivalent if there is A € L(V, W), invertible, such that 


(8.2) mg) =A 'X(g)A, VGEG. 


If these representations are unitary, we say they are unitarily equivalent if A can 
be taken to be unitary. 

Suppose that 7 and X are irreducible and unitary, and (8.2) holds. Then 
m(g)* = A*X(g)*(A71)*, for all g € G, so 7(g) = (A*A)m(g)(A*A)—1. By 
Schur’s lemma, A* A must be a (positive) scalar, say b?. Replacing A by b-1A 
makes it unitary. Breaking up a general 7 into irreducible representations, we 
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deduce that whenever 7 and 4. are finite-dimensional, unitary representations, if 
they are equivalent, then they are unitarily equivalent. 

We now derive some results known as Weyl orthogonality relations, which play 
an important role in the study of representations of compact Lie groups. To begin, 
let 7 and X be two irreducible representations of a compact group G, on finite- 
dimensional spaces V and W, respectively. Consider the representation vy = 7@X 
on V @W’ = L(W,V), defined by 


(8.3) v(g)(A) = m(g)AXg)*, g EG, AEL(W,V). 


Let Z be the linear subspace of £(V,W) on which v acts trivially. We want to 
specify Z. Note that Ag € Z if and only if 


(8.4) ™(g)Ao = Aom(g), Vg EG. 


Since this implies that the range of Ao is invariant under 7 and Ker Apo is invariant 
under , we see that either Ag = 0 or Ag is an isomorphism from W to V. In the 
latter case, we have 7(g) = Ao\(g) Ap‘, so the representations 7 and A would 
have to be equivalent. In this case, for arbitrary A € Z, we would have 


m(g)A = AXNg) = AAQ*2(g) Ao, 


or 7(g)AA5 | = AAG '7(g), so Schur’s lemma implies that AAj' is a scalar. We 
have proved the following result: 


Proposition 8.1. [f 7 and X are finite-dimensional, irreducible representations of 
G and if v = 7 ® X, then the trivial representation occurs not at all in v if 7 
and » are not equivalent, and it occurs acting on a one-dimensional subspace of 
V @W’ ifz and X are equivalent. 


The next ingredient for the orthogonality relation is the study of the operator 
(8.5) P= [ro dg. 
G 


Here 7 is a finite-dimensional representation of the compact group G, not neces- 
sarily irreducible, and dg denotes Haar measure, with total mass 1. Note that 


(8.6) my)P = [ res) dg = P= Pr(y), 
G 


for all y € G. Hence 
(8.7) P= P [ x(a) dg = [ Px) dg = P, 
G iC 


so P is a projection. Also, if 7 is unitary, we see that P = P*. 

Now, if 7 is unitary, it gives a representation both on the range R(P) and on 
the kernel Ker P. It is clear from (8.5) that, given v € V, ||Pu|| < ||v|| unless 
m(g)v = v, for all g € G. Consequently, 7 operates like the identity on R(P), 
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but we do not have 7(g)v = v for all g € G, for any nonzero v € Ker P. We have 
proved: 


Proposition 8.2. If 7 is a unitary representation of G on V, then P, given by 
(8.5), is the orthogonal projection onto the subspace of V on which 7 acts trivially. 


The following is a special case: 


Corollary 8.3. [f a is a nontrivial, irreducible, unitary representation, and P is 
given by (8.5), then P = 0. 


We apply Proposition 8.2 to 


(8.8) C= / ™(g) ® X(g) dg, 


G 


with 7 and J irreducible. By Proposition 8.1, we see that 
(8.9) Q =0 if z and X are not equivalent. 


On the other hand, if \ = 7, then Q has as its range the set of scalar multiples of 
the identity operator on V (if 7 acts on V). Note that 7 @ 7 leaves invariant the 
space of elements A € L(V, V) of trace zero, which is the orthogonal complement 
(with respect to the Hilbert-Schmidt inner product) of the space of scalars, so Q 
must annihilate this space. Thus Q is given by 


(8.10) Q(A)=(d'TrA)I, w=), d= dimV. 


The identities (8.9) and (8.10) are equivalent to the Weyl orthogonality relations. 
If we express 7 and X as matrices, with respect to some orthonormal bases, we 
get the following theorem: 


Theorem 8.4. Let 7 and X be inequivalent irreducible, unitary representations of 
G, on V and W, with matrix entries 1;; and Axe, respectively. Then 


(8.11) [rs ruto) dg = 0. 
G 
Also, 
(8.12) / ij(g)mKe(9) dg =0, unless i = k and j = 
G 
Furthermore, 
(8.13) [iro dg=d", 
G 


where d= dim V = Tr n(e). 
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Hence, if {7*} is a complete set of inequivalent, irreducible, unitary represen- 
tations of G on spaces V;,, of dimension d;, then 


(8.14) di!? x (g) 


aj 
forms an orthonormal set in L?(G). The following is the Peter-Weyl theorem: 


Theorem 8.5. The orthonormal set (8.14) is complete. 


In other words, the linear span of (8.14) is dense. If G is given as a group of 
unitary NV x N matrices, this result is elementary. In fact, the linear span of (8.14) 
is an algebra (take tensor products of 7” and 7° and decompose into irreducibles), 
and is closed under complex conjugates (pass from 7 to 77), so if we know it 
separates points (which is clear if G C U(N)), the Stone—Weierstrass theorem 
applies. 

If we do not know a priori that G C U(N), we can prove the theorem by 
considering the right-regular representation of G on L?(G): 


(8.15) R(g) f(x) = f (2g). 


If we endow G with a bi-invariant Riemannian metric and consider the associated 
Laplace operator A, which is then a bi-invariant differential operator, we see that 
the representation R leaves invariant each eigenspace Ez of A. Now, E is finite- 
dimensional, and the restriction Rz of R to E¢ splits into irreducibles: 


(8.16) Fye= En ®:::@® Ew, N=N(é), 


say Re| i. Rem. Thus there is a unitary map A : Ey, > V,, for some 


k = k(€,m), such that Re, = Ar* A+. If {e;} is an orthonormal basis of Vz 
with respect to which the matrix of +*(g) is (7i,(9)) , then u; = Ate; gives an 
orthonormal basis of E?,,, and we have 


(8.17) uj(ag) = », mh (g)u; (a). 


In particular, taking x = e, 
(8.18) ui(g) = S > esrh(9), cj; = u,(e). 
J 


This shows that each space F,,, consists of finite linear combinations of the func- 


tions in (8.14). Since 
LG) = QD Ee= DQ Lm: 
£ L m 


this proves Theorem 8.5. 
The following corollary will be useful in the next section. 
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Corollary 8.6. If G, and G2 are two compact Lie groups, then the irreducible, 
unitary representations of G = G xX G2 are, up to unitary equivalence, precisely 
those of the form 


(8.19) ™(g) = 71 (91) @ T2(g2), 


where g = (91,92) € G, and 7; € G; is a general, irreducible, unitary represen- 
tation of G;. 


Proof. Given irreducible, unitary representations 7; of G';, the irreducibility and 
unitarity of (8.19) are clear. It remains to prove the completeness of the set of 
such representations. For this, it suffices to show that the matrix entries of such 
representations have dense linear span in L?(G, x G2). This follows from the 
general elementary fact that tensor products of orthonormal bases of L?(G',) and 
L?(G2) form an orthonormal basis of L?(G x G2). 


9. Representations of SU(2) and related groups 


The group SU(2) is the group of 2 x 2, complex, unitary matrices of determinant 1, 
that is, 


(9.1) SU(2) = {( he a : |ai|? + |zo|? =1, 2; € c}. 
—22 21 

As aset, SU(2) is naturally identified with the unit sphere S° in C?. Its Lie algebra 

su(2) consists of 2 x 2, complex, skew-adjoint matrices of trace zero. A basis of 

su(2) is formed by 


1/i 0 1/o1 1 (Oi 
a) : 5(5.:): . (45): 5 (| i) 


Note the commutation relations 
(9.3) [X1, Xo] = X3, [Xo,X3] = X1, [X3, X1] = Xo. 


The group SO(3) is the group of linear isometries of R° with determinant 1. Its 
Lie algebra so(3) is spanned by elements J, £ = 1, 2,3, which generate rotations 
about the xg-axis. One readily verifies that these satisfy the same commutation 
relations as in 9.3. Thus SU(2) and SO(3) have isomorphic Lie algebras. There is 
an explicit homomorphism 


(9.4) p : SU(2) —+ SO(3), 


which exhibits SU(2) as a double cover of SO(3). One way to construct p is the 
following. The linear span g of (9.2) over R is a three-dimensional, real vector 
space, with an inner product given by (X,Y) = — Tr XY. It is clear that the 
representation p of SU(2) by a group of linear transformations on g given by 
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p(g) = gXqg~' preserves this inner product and gives (9.4). Note that Ker p = 
If we regard X;; as left-invariant vector fields on SU(2), set 


(9.5) A= X?+X3+ X}, 


a second-order, left-invariant differential operator. It follows easily from (9.3) that 
X, and A commute: 


(9.6) AX;=X,;A, 1<j<3. 


Suppose 7 is an irreducible unitary representation of SU(2) on V. Then 7 
induces a skew-adjoint representation dz of the Lie algebra su(2) and an algebraic 
representation of the universal enveloping algebra. By (9.6), dz(A) commutes 
with dx(X;), 7 =1,...,3. Thus, if 7 is irreducible, Proposition 7.4 implies 


(9.7) dx(A) = —A" 1, 


for some A € R. (Since dz(A) is a sum of squares of skew-adjoint operators, it 
must be negative.) Let 


Now we will diagonalize L, on V. Set 


(9.9) V.={veV:Liv=ipv}, Ve= Pe Vue 


ape spec Ly 


The structure of 7 is defined by how Lz and L3 behave on V,,. It is convenient 
to set 


(9.10) La = Lo FiL3. 


We have the following key identity, as a direct consequence of (9.3): 


(9.11) (Ly, D4] = tiDy. 


Using this, we can establish the following: 


Lemma 9.1. We have 


(9.12) a9, SV: 


In particular, if ip € spec Ly, then either Ly = 0 on V,, or u+1 € spec Ly, and 
also either L_ =O on V,, or p—1 € spec Ly. 


Proof. Let v € V,,. By (9.11) we have 


IyLsv = Lely tilyv =i(wt1)Lev, 


which establishes the lemma. The operators L+ are called ladder operators. 
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To continue, if 7 is irreducible on V, we claim that spec i-!L, must consist of 
a sequence 


(9.13) spec i *L1 = {uo, po +1,..., po +k = wr}, 

with 

(9.14) La: Viotj > Vuo+j+1 isomorphism, for 0< 7 <k—1, 
and 

(9.15) L— : Vyu,—3 — Vu,—-j-1 isomorphism, for 0 <j <k—-1. 


In fact, we can compute 


(9.16) L_Ly = 14+ 12 +4[ Lg, Ly] = —? — 17 —ily 
on V, and 

(9.17) Leh =— - 1? +ily 

on V, so 


L_Ly =p(ut1)—» on Vy, 


(9.18) 
L,L_ = p(u—1)—» on Vy. 
Note that since Dz and L3 are skew-adjoint, Li = —L* , so 
Py do =D fey holy =i ig, 
Thus 


Ker L, = Ker L_£L,, Ker£L_= Ker L,L_. 


These observations establish (9.13)-(9.15). 
Considering that dz acts on the linear span of {v, L,v,...L4*~"°v} for any 
nonzero v € V,,,, and that irreducibility implies this must be all of V, we have 


(9.19) dimV,=1, po << pu. 
From (9.18) we see that ji (j11 + 1) = A? = uo(~o — 1). Hence, 


k k 
(9.20) Hi — Ho = k => po = —5; Ha = 5: 


and we have 


(9.21) dimV=k+1, = The +2) = + (dim V? — 1). 


A nonzero element v € V such that Lv = 0 is called a “highest-weight 
vector” for the representation 7 of SU(2) on V. It follows from the analysis above 
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that all highest-weight vectors for an irreducible representation on V belong to 
the one-dimensional space V,,,. 

The calculations above establish that an irreducible, unitary representation 7 
of SU(2) on V is determined uniquely up to equivalence by dim V. We are ready 
to prove the following: 


Proposition 9.2. There is precisely one equivalence class of irreducible, unitary 
representations of SU(2) on C*+1, for each k = 0,1,2,.... 


We will realize each such representation, which is denoted D;,/2, on the space 
(9.22) P;, = {p(z) : p homogeneous polynomial of degree k on C*}, 
with SU(2) acting on P;, by 
(9.23) Drolg) f(z) = F(g7*z), 9 € SU), 2 EC’. 
Note that, for X € su(2), 


oe Cage er = —(O1f,02f)- xX & 


(9.24) dDx/2(X) f(z) = Zz 


where 0; f = Of /Oz;. A calculation gives 
Iy f(z) = —s(adf — 2202f), 
1 
(9.25) Igf(z) = — 5 (2 f — 20of), 


Oe —S (2201 reer an 


In particular, for 


(9.26) yns(z) = 2f Pah © Pe, OSS <K, 
we have 
foika. % 
(9.27) Iyer; = i(-5 + 5) Pri 
so 
(9.28) V = Pr ==> span rj = V_xsaty, OSG KK. 
Note that 
(9.29) Li f(z) =—z22Af(z), L-f(z) = f(z), 
so 


(9.30) Lepr =—(k-S) Pe it1, L-Pej =IPkj-1- 


706 B. Manifolds, Vector Bundles, and Lie Groups 


We see that the structure of the representation Dy /2 of SU(2) on Px is as 
described in (9.12)-(9.21). The last detail is to show that Dj 2 is irreducible. 
If not, then P;, splits into a direct sum of several irreducible subspaces, each of 
which has a one-dimensional space of highest-weight vectors, annihilated by L_. 
But as seen above, within P;,, only multiples of z¥ are annihilated by L,, so the 
representaiton Dx /2 of SU(2) on Px, is irreducible. 

We can deduce the classification of irreducible, unitary representations of 
SO(3) from the result above as follows. We have the covering homomorphism 
(9.4), and Ker p = {+I}. Now each irreducible representation d; of SO(3) defines 
an irreducible representation d; o p of SU(2), which must be equivalent to one 
of the representations Dx /2 described above. On the other hand, D2 factors 
through to yield a representation of SO(3) if and only if Dx 2 is the identity on 
Ker p, that is, if and only if Dz/2(—I) = J. Clearly, this holds if and only if k 
is even. Thus all the irreducible, unitary representations of SO(3) are given by 
representations D; on P2;, uniquely defined by 


(9.31) D;(p(g)) = Dj(g),  g € SU(2). 


It is conventional to use D; instead of D; to denote such a representation of 
SO(3). Note that D; represents SO(3) on a space of dimension 27 + 1, and 


(9.32) dD;(A) = —j(j + 1). 


Also, we can classify the irreducible representations of U(2), using the results 
on SU(2). To do this, use the exact sequence 


(9.33) 14 K > S' x SU(2) > U(2) 91, 
where “1” denotes the trivial multiplicative group, and 

(9.34) K ={(w,g) € 8’ x SU(2): 9g = ‘Tw? = 1}. 
The irreducible representations of S! x SU(2) are given by 
(9.35) Tmk(W,g) = w™ Dzjo(g) on Pr, 


with m,k € Z, k > 0. Those giving a complete set of irreducible repre- 
sentations of U(2) are those for which 7,,,(K) = J, that is, those for which 
(—1)™Dx/2(—D) = I. Since Dy2(—I) = (—1)*I, we see the condition is that 
m + k be an even integer. 

We now consider the representations of SO(4). First note that SO(4) is covered 
by SU(2)xSU(2). To see this, equate the unit sphere S? C R?+, with its standard 
metric, to SU(2), with a bi-invariant metric. Then SO(4) is the connected compo- 
nent of the identity in the isometry group of S?. Meanwhile, SU(2)x SU(2) acts 
as a group of isometries, by 


(9.36) (91,92): % = 91292", gj € SU(2). 


Thus we have a map 
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(9.37) r : SU(2) x SU(2) — SO(4). 


This is a group homomorphism. Note that (gi, g2) € Ker 7 implies g; = gg = +1. 
Furthermore, a dimension count shows 7 must be surjective, so 


(9.38) SO(4) + SU(2) x SU(2)/{4(Z, D}. 


As shown in §7, if G, and G2 are compact Lie groups, and G = G'; x Gg, then 
the set of all irreducible, unitary representations of G, up to unitary equivalence, 
is given by 


(9.39) {1(g) = ™(g1) @ m2(g2) : ty € G5}, 


where g = (g1, 92) € Gand re j parameterizes the irreducible, unitary representa- 
tions of G’;. In particular, the irreducible unitary representations of SU(2)xSU(2), 
up to equivalence, are precisely the representations of the form 


(9.40) Yre(g) = Dzj2(g1) @ Dejo(g2), ke € {0,1,2,...}, 


acting on P, @ Pp ~ Cet! @ C1. By (9.38), the irreducible, unitary rep- 
resentations of SO(4) are given by all yx¢ such that k + @ is even, since, for 
po = (—I, —I) € SU(2)xSU(2), Yee(po) = (—1)**4I. 

We next consider the problem of decomposing the tensor-product representa- 
tions Dz /2 @ Dez of SU(2) (ie., the composition of (9.40) with the diagonal map 
SU(2)39SU(2) x SU(2)) into irreducible representations. We may as well assume 
that  < k. Note that 7~¢ = Dx/2 ® Dez acts on 


Pre = {f(z, w) : polynomial on C? x C?, 


(9.41) ; ; 
homogeneous of degree k in z, @in w}, 

as 

(9.42) Treg) F(z, w) = f(g-*z,g7 *w). 


Parallel to (9.25) and (9.29), we have, on Pye, 


Iif=- 5 (a9: f — 2902, f + wWidu, f — W20uf), 
La f =— 2202, f — w20w, f, Lf = 2102) f + W10w f. 


To decompose Px¢ into irreducible subspaces, we specify Ker L. In fact, a holo- 
morphic function f(z, w) annihilated by L is of the form 


(9.43) 


(9.44) f(z, w) = g(Z2, W2, We21 — 22W1), 
and the kernel of L4 in Px¢ is the linear span of 
(9.45) Wii 2, ) = zh Hah # (woz —2zgui)", O<pK<l. 


A calculation gives 
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(9.46) Li rep = x(k + £— 2p) Prep. 


a 
2 
It follows that, for fixed k, ,0 < ¢ < k, and for each p = 0,...,¢, Wee, is the 
highest-weight vector of a representation equivalent to D(;.4.¢—2,,)/2, $0 we have 


(9.47) 


e 
Dyj2 ® Depo * QD Dewte-2y)/2 = Dero /2 ® Dwg pati O ++ ® Danry2- 
p=0 


This is called the Clebsch—Gordon series. Extensions of the results presented here 
to more general compact Lie groups, due mainly to E. Cartan and H. Weyl, can 
be found in a number of places, including [T, T2, Varl, Wal1]. 
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